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Chapter 1 


Introduction, Motivation and Background 


This thesis deals with the topic of ultrafast, ali-optical switching and logic using spatial and spatio-temporal solitons. The 
main focus is on the accurate analytical and numerical modeling of the interaction among solitons in geometries that allow 
for logically-complete, cascadable logic gates with fanout and level restoration. Detailed modeling will assist in subsequent 
experimental investigation by identifying regions of stability, robust operation and reliability, and candidate material systems. 

The work in this thesis makes significant contributions in the areas of fundamental nonlinear spatio-temporal propagation 
and optical switching and logic. The first contribution is the result of a multiple-scales derivation directly from Maxwell’s 
equations, which results in a first-order, multi-dimensional, vectorial differential equation that is accurate beyond the standard 
paraxial and slowly-varying envelope and amplitude approximations. This result is fundamental to the studies of this thesis 
and will have application in other areas as well, such as optical communications, short-pulse passively mode-locked lasers, and 
atmo.sphcric pulse propagation. 

The second contribution arises from the systems-level approach taken to optical switching and logic. Most optical switching 
technologies do not satisfy the basic requirements for a logic gate, thereby rendering them of little use in applications beyond 
simple, single-stage switching operations. The novel logic gales studied here satisfy the necessary requirements and thus have 
the potential lo make an impact in areas of ultra-high speed switching and logic systems. The desirable properties of these logic 
gales are faciliiaied by the non-diffracting and/or non-dispersing nature of optical solitons. 

LI Outline 

The remainder of this chapter briefly presents background in device requirements and optical switching and logic. Section 1.2 
provides motivation and a brief overview of promising applications for optical logic. Section 1.3 covers generic requirements 
for sw Itching and logic devices and additional specific requirements for optical devices. A brief review of other contemporary 
optical switching devices and their shortcomings is presented in section 1.4. Logic devices which specifically take advantage 
of the properties of optical solitons are discussed in section 1.5. which also provides an introduction to the optical soliton logic 
gaies.siudied in this thesis. 

Detailed backgri'^und on optical solitons is given in Chapter 2. First, section 2.1 provides historical background in solitary 
w ave and soliton phenomena. Section 2.2 covers some preliminary details that lead directly to the discussion of optical solitons 
and solitary waves. The following sections then discuss spatial (sec. 2.3). temporal (sec. 2.4), and spatio-temporal (sec. 2.5) 
Optical solitons. 

Chapter 3 derives the multi-dimensional vector w ave equations necessary for the numerical simulations of the soliton-based 
logic gates presented in later chapters. Section 3.1 covers a standard treatment of a fully vectorial, nonlinear Helmholtz- 
type wave ecjuation along with background on the linear and nonlinear susceptibilities. This second-order wave equation is 
unsuitable for the purposes of this thesis due to the difficulty in numerically propagating an initial field distribution because 
of the fast time and space scales in the equation. Instead, section 3.2 derives an asymptotic vector nonlinear wave equation 
directly from Maxwell's equations which is first-order in the propagation coordinate and depends only on the scales of the 
slowly-varying envelope, and is thus more suitable for numerical propagation. The importance of the results obtained in this 
chapter arises from the higher-order nature of the derived equation, which extends beyond the approximations made in the 
well-known multi-dimensional nonlinear Schrodinger (NLS) equation, resulting in the capability to describe propagation of 
vector optical field envelopes with very broad spectral content in both the spatial and temporal frequency domains, including 
the effects of higher-order nonlinearities. 

Chapter 4 describes the split-step numerical method used to solve the vector nonlinear wave equations. The basis of 
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this method is to treat linear and nonlinear (inhomogeneous) propagation in separate steps, performing linear propagation 
in its natural Fourier domain and nonlinear propagation in the real-space domain. For a small step size, this splitting is a 
good approximation. Section 4.1 covers multi-dimensional linear spatio-temporal diffraction, which can be solved without 
approximation as an initial value problem. Noting that linear and nonlinear propagation are not separable, section 4.2 derives a 
split-step formulation which is approximately second-order accurate in step size. This formulation is then applied separately to 
(1+1)-D spatial and (2 -h 1)-D spatio-temporal nonlinear propagation, to be used in later chapters. Finally, section 4.3 discusses 
the accuracy of the method as it applies to problems of later chapters. 

The use of one-dimensional spatial soliton interactions for logic gates is presented in Chapter 5. The first section covers the 
basic soliton interaction geometries useful for three-terminal, restoring logic, noting that the collision and dragging geometries 
using orthogonally polarized solitons, which are of the general class of angular deflection gates, provide the best performance 
in terms of large gain with high contrast. Section 5.3 then examines the effects of linear and two-photon absorption on the 
propagation of a single spatial soliton and develops appropriate figures-of-merit for use in comparing the suitability of nonlinear 
materials for soliton logic gates. The collision and dragging logic gates are then compared in the presence of absorption using 
the material parameters of fused silica, where the spatial dragging gates generally perform better because of their shorter gate 
lengths. The final section (5.4) computes the transfer functions of the collision and dragging gates and determines the maximum 
fanout that these gates can provide in a cascaded system. The results of this chapter provide guidance for the spatio-temporal 
logic gates of the next chapter, which is of ultimate interest. 

" Logic gates based on the interactions between two-dimensional spatio-temporal solitary waves is the subject of Chapter 6, 
which have the potential for greater than THz switching rates with nJ to pJ switching energies. Section 6.1 discusses the 
propagation of a single spatio-temporal solitary wave with higher-order linear and nonlinear effects as derived in Chapter 4. 
This section shows numerically that stabilized solitary waves suitable for logic gates should exist. With these results, section 6.2 
studies the vectorial interaction between these spatio-temporal solitary waves with emphasis on the dragging interaction. For 
completeness, section 6.3 presents results on cascaded logic. It is shown that the dragging logic gates can provide reasonable 
fanout in a cascaded, logically-complete, system, but also that, ultimately, the performance of the logic gate will be limited by 
the effects of Raman scattering. 

Finally. Chapter 7 concludes, noting that this thesis provides the theoretical and numerical development that is necessary to 
study a novel class of all-optical logic gates. Numerical simulations have shown that these gates satisfy the system requirements 
that sueuest their use beyond simple, single-stage operation, and paves the way for future theoretical and experimental work on 
ultrafa.st. all-optical logic systems. 


1.2 Applications in Switching and Computing 

The traditional advantages that optics hold over electronics are the inherent speed, parallelism, and lack of inductive or ca¬ 
pacitive crosstalk, although crosstalk does occur to some extent in linear propagation due to scattering and diffraction, and in 
nonlinear propagation, which plays an important role in the topic of this thesis. These properties allow linear optics to map 
well onto problems in interconnection [1]. It is generally accepted that optical interconnects will replace electric transmission 
lines in many applications. Indeed, this is already occurring in long-haul communications networks and is expected to occur 
in short-haul and local-area networks as well. It is an open question to determine how far down the interconnect hierarchy this 
trend will reach, but it is fairly clear that it will extend to system level switching fabrics. In all of these cases though, the data 
is actually switched electronically, or under electronic control, and processed electronically. 

It is less clear vv hat role optics will play in switching and digital computing however. The soliton-based logic devices studied 
in this thesis are one approach to all-optical logic for switching and computing applications. Numerous all-optical switching 
and logic devices have been proposed, but none so far have reached the stage of practicality for real systems. There are certain 
minimum requirements for a logic device, and many of the optical approaches to logic fail to meet them. These requirements 
are discussed in section 1.3 and some contemporary optical switching devices and their failings are discussed in section 1.4. 
A clear understanding of these failings aids the present study of optical soliton-based logic devices. As section 1.5 explains, 
the soliton devices completely satisfy the fundamental requirements (and most of the practical requirements as well) for digital 
logic 12,3) and might succeed where the other optical approaches have failed. 

The following sections discuss two general areas for which restoring, cascadable logic gates, such as those studied in later 
chapters of this thesis, might find widespread application. 

1.2.1 Optical Communications Networks 

With the advent of high-bandwidth optical communications [4], high-speed switching technology becomes a necessity. Because 
information will be transmitted optically, it makes sense to explore optical technologies and devices to perform (or at least assist 
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in) the switching necessary to multiplex, demultiplex, and route information to the correct destination. Future bandwidth needs 
will be too great for an electronic-only solution to the switching problem. An advantage of optical devices in communications 
systems is the ability to switch and/or perform logic at the bit rate. This means that switching and logic operations are performed 
at and scale with the rate of incoming data, which is especially important in Tb/s communications switching and routing 
applications. Conversely, interconnected electronic gates cannot arbitrarily scale with the data rates and may ultimately be 
limited to speeds of about 50 Gbit/s [5]. 

Optical switching and logic devices are well suited for time-division multiplexed (TDM) data transmission [6], in which 
many low bandwidth channels are interleaved into individual time slots within a single high bandwidth channel. Here, the 
aggregate data rate may exceed 1(K) Gb/s [7], which is well beyond the performance expectation of practical, low power, 
electronic or optoelectronic switching networks. Therefore, at the very least, optical switching technology will find use in the 
intermediate processing layer in which the individual channels are optically separated (demultiplexed) from the single, high¬ 
speed transmission channel, for subsequent processing in the electronic/optoelectronic domain. An additional area of use is in 
data regeneration within an optical transmission line, which requires an optical logic gate. Given sufficient noise margins, an 
opticallogic gate regenerates a noisy or attenuated data stream with fresh pulses that are derived from a power supply or clock, 
such that timing, amplitude, shape, etc., are completely restored. This type of all-optical repeater may eliminate the need for 
high-speed, high-power electronic repeaters in long-haul transmission. 

Currently, the most popular multiplexing technique for long-haul communications and local-area optical networks is wave¬ 
length division multiplexing (WDM), in which each low bandwidth source channel is assigned a slot in the optical frequency 
spectrum. WDM has the advantage that multiplexing and demultiplexing can be performed by spectrally selective, passive, 
optical elements. However, due to the difficulties in developing a large array of stable wavelength sources, the individual 
channels in a WDM system may exceed data rates of 10 Gb/s [8,9], which may be of sufficient bandwidths that other advan¬ 
tages of optical switching and logic devices, such as the elimination of the optoelectronic conversion process [10], potentially 
lower power, and the use of deep optical circulating delay lines [II] for contention resolution, may play a role in the choice of 
implementation technology. 

More complex operations are required in packet switched optical networks [10,12], In these networks, at each node, a 
packet header is decoded, which determines the direction in which to route the packet. Header recognition is a simple digital 
correlation operation, but must be performed at the bit rate. Typical implementations based on optical switches [13] perform 
this operation in parallel, which results in a 1//^ loss where N is the number of bits in the header. In order to reduce the latency 
at each node and avoid the fanout loss, a digital comparator, or shift register, could be used instead, based on optical logic gates. 
In addition, optical logic could be used for contention resolution, real-time encryption/decryption, and other complex tasks as 
well. Therefore, intelligent digital optical processing may'play a significant role in the development in the next generation of 
high-speed optical communications networks. 

1.2.2 Digital Optical Computing and Processing 

Determining what role optics should play in general-purpose computing is an open question. The technological lead, resources, 
and infrastructures that electronics enjoy may be insurmountable for general-purpose optical computing. Even a compelling 
optical technology may not be enough to impact the future of digital computing, but there is potential in niche areas which are 
discussed in this section. 

Simple digital optical logic circuits have been demonstrated [14,15] and proposed [16], which pave the way for more 
complex systems. However, many of these studies have been based on optical switches or gates that do not completely satisfy 
the requirements for digital logic. For example, the nonlinear Fabry-Perot etalon [17] is a two-terminal device which must be 
biased about an operating point with a holding beam and is very sensitive to variations in device parameter (i.e. the transfer 
characteristics vary from device to device). All of these factors preclude their use in complex systems. The logic gates studied 
in this thesis, on the other hand, do not suffer from these problems and could serve as fundamental building blocks in more 
complex systems. 

In the absence of high space-bandwidth optical interconnections, bit-serial computation and signal processing [18] is a 
promising application area, especially when the problems of interest can be processed in a highly pipelined manner [19]. 
Genera! purpose optical computers [20] have even been demonstrated using the bit-serial approach. One potential limitation 
for ultra-fast digital optical computing is the memory store. It has been shown, however, that an optical delay line can be 
used to implement a general machine [16], at the cost of increased latency. These tradeoffs are based on the transformations 
of computational origami [21]. Even more computational capacity can be realized with the combination of optical logic gates 
with optical interconnection, for which the power of the interconnection network allows for the efficient mapping of problems 
that would be difficult to process electronically [22]. 
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1.3 Requirements for Digital Switching and Logic Devices 

Routing switches and logic elements must possess certain properties in order to function properly within a system. The major 
distinction made here between switching elements and logic gates is that switching elements physically direct electrons or 
photons from one or more input ports to one or more output ports, while logic gates replace electrons or photons on the input 
ports with “new” electrons or photons derived from a power supply which then appear at the output ports. The requirements 
for a switching device are much less stringent than those for a logic device and will be discussed first. Additional requirements 
for an optical logic device are discussed in the following section. 

1.3.1 Switching Devices 

Optical routing switches are typically used Just in the first switching stage to reduce data bandwidth to a level that electronics 
can handle in subsequent stages. These switching elements use a control to physically direct light from one or more input 
ports to one or more output ports and thus can be used to implement multiplexing and demultiplexing functionality. The 
routing decision is based either on the intensity of the signal inputs or by the presence of an externally derived control which 
is inde%ndent of the switching fabric. Therefore, the control is typically not of the same format as the data, meaning that the 
output of the switch cannot directly serve as the control for another switch [3]. As a result, this type of routing switch has fewer 
requirements than a logic gate. 

The most important requirement of a switching device is high contrast operation. In binary transmission, the contrast 
determines the threshold level which separates the high and low states. Higher contrast gives larger noise margin, resulting in 
lower bit-error-rate (BER). For the output of a switching element, the overall contrast depends both on the contrast of the data 
stream (the difference in signal level between the two binary states which depends on the modulation ratio at the input of the 
tran.smission line and on the transmission line itself) and the contrast of the switching operation. 

Another important requirement is switch transparency. If the switch is lossy or only a small fraction of the input is diverted to 
the output by the switching action, then high-speed, single-shot detection may become impossible, resulting in information loss. 
This does not mean that the switching operation must provide gain, though. For high-contrast output, saturation of an external 
amplification stage can be used to restore the data signal levels, but the gain recovery time may limit the data rate and introduce 
inter-symbol interference, the contrast at the output of the amplifier may be reduced, and complete logic level restoration (as 
discus.sed later for optical devices) is not obtained. The amount of gain is also limited due to amplified-spontaneous emission 
(ASE) noise, which increases the noise floor. 

Routing sw itches do not have signal level restoration (in the absence of external amplification or level shifting) and the 
outputs may degrade due to loss or crosstalk and therefore BER may suffer from a long cascade of switches. If data pass 
through many levels of switching, such as in a multi-level implementation of an N:N crossbar or the binary tree structure 
required for 1:N or N:1 multiplexors or demultiplexors constructed from 1:2 or 2:1 switches, then gain and level restoration 
as mentioned previously may be a necessity, subject to the limitations discussed in addition to ASE noise. Three-terminal 
operation al.so becomes necessary when many levels of switching elements are used such that the operation of the switch in 
unidirectional, w hich prevents any light entering the output ports from affecting the operation of the switch. 

Therefore, in order to implement more complex, multi-level switching fabrics, or to handle data-dependent (i.e. self-routing) 
sw itching operations, sw itching elements must have the additional properties and fulfill the more demanding requirements of a 
logic gate. It is in these applications that most all-optical switching devices are inadequate, as discussed in section 1.4. 

1.3.2 Logic Devices 

A digital logic gate performs a Boolean operation on one or more binary inputs. All inputs and outputs are of the same physical 
format thus allowing control to be distributed throughout the switching fabric [3] such that one data stream can control another. 
The logic gate completely restores signal integrity and timing by replacing the electrons or photons at the input with new 
electrons or photons from the power supply that go to the output. 

Logic levels (i.e. 0 or 1 in digital logic) are physically represented by signal levels, which may be voltage or current 
for electronics; or for the representationally richer case of optical logic, amplitude, phase, polarization, or color, and are 
differentiated by a threshold nonlinearity. Information is carried by the logic level and is determined by the interpretation 
of the signal level (based on the threshold). Since the signal level can be altered during a logic operation, small noise or loss 
can accumulate throughout the computation and cause information loss. A digital representation of information avoids this 
degradation if the signal level is restandardized at each step [23]. Restandardization means that the signal levels are restored to 
values (that are the same throughout the system) which represent valid logic states. As a result, a small deviation about a valid 
logic level propagates at most one stage and the logic level remains intact (within the allowable noise margin). The is termed 
logic level restoration. 
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nMOS inverter transfer function 



Fieure 1.1: nMOS inverter circuit and associated transfer function. Near the threshold voltage is a region of small-signal gain. 
Large-signal gain is also provided (Vdd > Kh) and the high and low output states are saturated. 


The properties required by a digital electronic [24] or optical [25] logic device are well known. Perhaps the most important 
requirements, upon which many other requirements depend, are gain, saturation, and threshold. These characteristics are illus¬ 
trated in Figure 1.1, which shows the sigmoidal input-output relationship for a simple nMOS inverter. At the most fundamental 
level, a region ( typically near the input threshold) of differential or small signal gain in the input-output transfer function, where 
a small change in the input produces a large change in the output, is required (but not sufficient) for a restoring logic gate [26]. 
Small-signal gain provides sharp switching characteristics and allows for low modulation depth on the input signal biased about 
the threshold level to produce high contrast switching. Outside the linear region of small-signal gain, saturated levels are key 
to providing large noise margin by attenuating small variations in the input about valid logic levels (i.e. inputs lying within the 
"on" and "off regions shown in the figure), thus producing little change in the output. 

Restoration of the logic level prevents accumulation of errors due to attenuation or crosstalk by providing fresh gate output 
levels directly from the power supply. In the case of an optical gate, logic level restoration must include, in addition to power 
levels, beam shape and position, pulse duration, color, polarization, and timing [25]. These levels are standardized throughout 
the entire system typically by the common power supply and ground. Standardization is possible because small-signal gain and 
saturated levels in the nonlinear transfer function allow for a wide tolerance to the variation in operational characteristics of the 
devices in the system [24]. 

Large-signal gam means that the output of the gate is larger than the input required (at least threshold) to switch the gate. 
Without large-signal gain, fanout, in which the output of a gate provides inputs to many gates in order to implement arbitrary 
logic functions, Is impossible. Most optical logic gates cannot intrinsically provide large-signal gain, and in fact, the output is 
typically much smaller than the input. Although a separate, external amplification stage can be used, this may limit the bit rate 
and result in reduced contrast and increased BER when the amplifier noise exceeds the device noise margins. A logic gate with 
intrinsic gain, in which a small input controls a large power supply, does not suffer this source of noise. 

Additional requirements for synthesis of arbitrary logic and switching are logical completeness and cascadability. A com¬ 
plete set of logic functions must include inversion [24]. Inversion is a basic function of MOSFET electronics and the inverter 
structure forms the basis towards implementing more complex Boolean operations such as multi-input and logically complete 
NOR. Cascadability means that the output of one gate can directly drive the input to the next and allows the direct implemen¬ 
tation of multi-level logic. Logic level restoration and inputs and outputs of the same format are therefore crucial to allowing 
cascadability. 

The most successful logic devices have three orthogonal ports (which can be separate in time, space, wavelength, or polar¬ 
ization) and input-output isolation. A three-terminal device, such as an electronic transistor, ensures that the output of the gate 
depends only on the inputs and the output does not perturb the operation of the gate [24]. This input-output isolation results in 
one-way operation and prevents downstream noise which enters the output port from affecting the operation of the gate. There 
are many examples of two-terminal devices in optics. The problem with two-terminal devices is that they work equally well 
in either direction [25]. The classic examples are the laser [27] (a two-terminal device with gain) and nonlinear Fabry-Perot 
etalon [17] which, when critically biased [28], can switch either through the input or through an unwanted reflection back into 
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the output. 

1.3.3 Optical Devices 

The main additional requirement for an optical logic device is operation independent of the relative phase between the beam or 
pulse of light that induces the switching operation and the one that is switched. Phase-sensitivity is not an issue in electronic 
devices since the phases of the interacting electrons are continuously randomized from phonon collisions [3]. Phase insensitive 
logic operation is critical in optics because of the coherence of laser sources and since phase is difficult to control in complex 
systems due to the small optical wavelength. An optical logic gate can be made insensitive to relative phase by using the 
interaction between optical inputs of orthogonal polarizations or inputs that are spectrally resolved m angle or frequency such 

that there is no linear interference, but there may be nonlinear interference. 

Material nonlinearity is a necessary requirement in order to implement an optical logic gate. The desire for a low loss, 
ultrafast gate opposes the desire for low switching energy which can be achieved via the use of resonant nonlinearity. Because 
of the large loss associated with resonant nonlinearity, most ultrafast optical switching and logic gates utilize a much smaller 
nonresonant nonlinearity. A detailed exploration of the effect of loss on optical soliton logic gates is presented in section 5.3. 

It is also desirable for the nonlinear coupling between the interacting optical fields to depend only on intensity, but this can 
only be guaranteed in certain situations, such as with orthogonal circular polarizations in bulk isotropic media [2,29]. Other 
situation' can reduce phase-dependent nonlinearity by using the accumulating phase difference between orthogonally polarized 
linear eigenmodes of naturally or artificially birefringent media to wash out the phase-dependent nonlinear effects. 


1.4 Digital Optical Switching and Logic Devices 

Some i>l the most promising all-optical devices for ultrafast switching and logic are the nonlinear directional coupler (NLDC), 
optical Kerr gate, nonlinear optical loop mirror (NOLM) or Sagnac gate, and the terahertz optical asymmetric demultiplexor 
(TO.-XDi and its variants. Even though these devices may possess a region of small-signal gain, only the NLDC has saturated 
lesels in the device input-output curve and none intrinsically provide a robust way to obtain large-signal gain. As a result, the 
mam requirements for these devices are those of a single-stage switch, the most important of which is high contrast, which 
rcsuliN from complete switching. 

In man> of these devices, complete switching occurs upon accumulation of a 7X differential nonlinear phase shift. This phase 
I.s between two independent components of the same beam or pulse and is not to be confused with the relative phase between 
control and data as discussed earlier on phase-insensitive operation. The accumulated nonlinear phase shift can be written 
gcncncall) 

= ( 1 - 1 ) 

where //: is the nonlinear Kerr coefficient defined by the total index n = + the free-space wavelength, d is the 

interaction length. A is the electric field amplitude of the light that induces the switching operation, and 5 is a constant factor 
determined b\ the nonlinear interaction. The phase shift is proportional to the nonlinear index, the peak intensity, and the 
interaction length, and can be made large by an increase in any one of these parameters. At the output of the device, this phase 
shift manifests itself as an amplitude change, either as a rotation of the polarization followed by an analyzer or due to coherent 
beam combining at an output coupler. 

Even though silica has a small nonlinear index for example (see Appendix D), low loss fiber can provide very long inter¬ 
action lengths (/ with tight transverse confinement [30). thus making fiber based switching devices very popular [3]. Because 
switching iKcurs within the fiber, the control and data must be of different wavelengths and/or polarizations in order to avoid 
the power loss in coupling in the absence of phase locking and in order to discriminate the switched data from the control at 
the output. As a result, operation independent of the relative phase between data and control is achieved, but group-velocity 
or birefringence walkoff limits the interaction length (the walkoff must be less than the pulse durations) and therefore the data 
rate. These devices are now discussed individually in some detail. 


1.4.1 Nonlinear Directional Coupler 

This nonlinear sw itching device is based on the directional coupler as shown in Figure 1.2. The linear directional coupler relies 
on the coherent interaction between two waveguides placed near each other. The coherent interaction is due to evanescent 
overlap between fields confined within the individual waveguides and results in periodic energy exchange between the guides. 
The presence of nonlinearity frustrates this coherent exchange by detuning the waveguides and can result in intensity-dependent 
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Figure 1.2: Directional coupler. In linear operation, energy is exchanged periodically between guides in the coupling region. 
Nonlinear operation can frustrate this coupling, resulting in controlled switching. 

switching between the output ports. The NLDC was originally proposed [31] as an alternative to bistable switching devices such 
as the nonlinear Fabry-Perot etalon. More recently, it has received much attention, particularly in the capacity of a prototype 
all-optical switching device [32], and for ultrafast optical demultiplexing [33]. 

In the limit of weak coupling between identical guides, the coupled mode equations for the NLDC are 

=ic42+/:/n2l'4i|'^l (1-2) 

oz 

=KA\-hkfn2\A2\^ A2, 
oz 

where K is the linear coupling constant, and A, and A; represent the electric field amplitudes in guides 1 and 2 respectively. 
Assuming no nonlinearity, the amplitude in each guide can be written 

^4i = -K-/4| =e> = niCos(Kc)-f-iisin(Kc) (L4a) 

dz“ 

= => /f->(c) = fl2Cos(ic:)-f/J2sin(ic:). (L4b) 

dz- 

With the boundary conditions/4i (0) = Aq and j 42(0) = 0. the coefficients h\ = 02 = 0, leading to the result 

(c) = Aocos (kc) (L5a) 

y42(:) =/4osin(K;). (1.5b) 

The output intensities at the bar and cross pons are given by 

/,(d) = /oCOS-(Ttr//Lcoh) 

/;(d) = /„sin-(Tu//Lcoh)- 

where the coherence length is defined Z,coh = n/K and is the length over which light is completely coupled from one guide to 
the other and back. Therefore, for a half-beat length coupler with d = Lcoh/2, input into guide 1 emerges at the output from 
guide 2 (cross slate). 

in nonlinear operation, using the nonlinear coupled-mode equations, the fractional intensity in each guide is written [31] 

(1.7b) 


^ V^-^’L/cnij yj’ 

where cn() is a Jacobi elliptic function. As before, this solution assumes that there is no light initially launched into guide 2. 
The critical intensity is defined as 

/ . _ . fl .81 
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Figure 1.3: Operation of half-beat length nonlinear directional coupler. For low power, or linear, operation, the inf3ut pulse exits 
the cross port (guide 2) while in nonlinear operation, the central portions of the pulse, which exceed the critical intensity, exit 
the bar port (guide 1) with the wings exiting the cross port. 

and represents the input intensity for which the light is divided equally between the two output ports [31]. For a half-beat length 
coupler, the critical intensity results in a 7i phase shift for the field in guide 1, and 8 = 1 in equation 1.1. Above the critical 
intensity, the fraction of light exiting the bar port increases towards unity. This two-terminal device performs a switching 
riperation since photons from the data stream are directed into one of two output ports. 

A half-beat length NLDC is shown schematically in Figure 1.3, where a low-power data pulse couples to the cross port while 
a high-power data pulse remains in guide 1 (bar state). The high-power case can be understood intuitively by considering the 
nonbnear increase in the core index which more strongly guides the light, thus detuning the coupler. Notice that only the central 
portion of the high-power pulse exceeds the critical intensity and remains in guide 1. This effect is called partial switching, or 
pulse breakup, and is characteristic of many optical switching devices and results in reduced integrated energy contrast [33]. In 
fact, partial switching is a problem with all of the non-soli ton devices discussed in this section. Solutions to this problem are the 
use of square pulses [34] such that the intensity is constant across the pulse duration, or the use of temporal solitons [35-37], 
w hich propagate unchanged and tend to switch as a unit because of uniform phase across the pulse profile [38]. 

Fiuure 1.4 shows a plot of the switching fraction into the bar output port versus normalized peak pulse intensity launched 
into guide 1 for the half-beat length NLDC. The solid curve shows the fraction of the light that remains in the bar state assuming 
a con.stani intensity square-top pulse as given directly by equation 1.7a. The curve for the soliton case has siniilar shape with 
sharp su itching characteristics [35], but the switching intensity threshold is about twice the critical intensity because the induced 
nonlinear phase is half that of the plane wave case, as shown in section 2.2. The dashed curve shows the light fraction assuming 
a- non-soliion .sech-() input intensity profile and takes into account partial switching which leads to reduced contrast operation 
as indicated by the greatly reduced switching fraction. In either case, at low input levels, all of the light switches over to the 
cross port, while at high inputs, most of the light exits the bar port. 

Direct cxscadability can also be addressed using Figure 1.4, where the dotted line represents the input threshold for which 
the output of the bar port exceeds /cni- Figure 1.5, which shows the transfer function for each port of the device, provides the 
same information more clearly. As shown in the figure, for a square-top pulse, an input intensity greater than 1.1 /„! is required 
to exceed the threshold intensity at the bar port. In this case, the output of the bar port can be used as the input to another 
device resulting in 50*7, switching of that second device. Therefore, much higher inputs than I.l Unx are necessary to cascade 
to another device, but. due to the lack of large-signal gain and in the absence of external amplification, any optical losses in 
the system will eventually result in the degradation of the signal level to the point that switching ceases. The situation is worse 
in the case of a nonuniform pulse. Here, a peak input intensity of greater than 1.6 Icni is required for at least 50% switching 
of a subsequent device. Indeed, because of partial switching, attempts at cascading two such devices have met with limited 
success 139). A final point to note is that when designed as a half-beat-length coupler in linear operation, the NLDC transfer 
function has quasi-saturated levels which attenuate fluctuation in the input level and allow for some noise margin. 

As described so far. the NLDC is a two-terminal device whose switching operation is determined by the intensity of the 
input signals. A three-terminal version can also be constructed by using a high-power control pulse to switch a low-power data 
pulse as shown in Figure 1.6, but this device is not directly cascadable either due to the lack of large-signal gain and inconsistent 
signal representation (using different wavelengths). The control pulse must be of different central frequency or polarization in 
order to avoid phase-sensitivity and to extract the data at the output with zero background. Experimental demonstrations have 
used different frequencies because of the large difference in coherence lengths between orthogonal polarizations [33]. There 
are still partial switching problems if the control pulse does not have constant amplitude. An additional problem is dispersive 
walkoff, which will be discussed in more detail for the Kerr gate. A recent demultiplexing demonstration [33] used a longer 
control pulse length and allowed the data pulse to walk-through the control to achieve complete switching, and therefore, high 
contrast. Lengthening of the control pulse means that the separation between data pulses needs to be larger, thereby reducing 
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Figure 1.4: Switching fraction into the bar port for a half-beat length NLDC versus input peak intensity. The fraction into the 
cross port is inverse of the curves shown. The solid curve is for a square-top pulse while the dashed curve is for a non-soliton 
sech-O intensity profile pulse indicating incomplete switching. The dotted curve shows the threshold for which the bar state 
output exceeds the critical intensity. 


the data rate and increasing the switching energy, but also allows for some tolerance to timing jitter. 

Because of the region of small-signal gain shown in Figure 1.5, it is possible to obtain large-signal gain with this device, 
based on the process of light-induced symmetry breaking (40). as originally studied using a phase-dependent interaction be¬ 
tween light input into each port. The input into port 1 is biased with a “power supply or clock pulse near the critical intensity, 
which is an un.stable operating point. Due to small-signal gain, a weak control input into port 2 is amplified, resulting in 
complete switching into either the bar or cross state depending on the relative phase between the inputs in ports 1 and 2. For 
a multiple of 271 relative phase, the inputs add constructively and the output emerges from the bar state, while for 7t relative 
phase, the inputs add destructively causing the output to emerge from the cross state. This switching is critically dependent 
on the relative phase between the data and control, and therefore cascadability may be difficult to achieve if the phase of the 
output depends on the intensity of the inputs. It also appears that such a device would be sensitive to variation in waveguide 
parameters. 

A phase-independent version of this three-terminal device with gain can be realized using weak amplitude to initiate switch- 
ing into the bar port, thereby providing a controlled inversion operation at the cross port. This device must therefore meet the 
more demanding requirements of a logic gate. Again, it port 1 is biased by a clock pulse at some level less than the critical 
intensity such that the clock exits the cross port, then additional light of orthogonal polarization or color entering port 1 can 
activate switching into the bar port. Therefore, logic level restoration is obtained since only the power supply or clock pulse 
(from the cross port) is passed to subsequent gates when the control pulse is blocked by an analyzer or spectral amplitude filter. 
For instance, referring to Figure 1.5, for bias in port 1 of 0.8 /cnt, then an additional contribution of 0.6 /cnt (for cross-phase 
modulation coefficient of 2/3) can cause complete switching into the bar port. This example illustrates a gain of 1.3, which is the 
maximum for the standard device and may not be achievable in practice because there is strong variation in switching intensity 
with power supply and device variations will not allow for level standardization. It is clear that the larger the small-signal gain, 
the less the input required to switch the state of the device. Because of the ripples in the transfer function, a control pulse of 
intensity 1.2 will allow for an output level of 0.2 at the cross port, thereby significantly reducing the contrast of the gate. When 
the switching is incomplete, though, the small-signal gain may be less than unity, precluding the possibility for large-signal 
gain, as illustrated by the transfer function for the non-soliton sech^O (in intensity) pulse. 
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Figure 1.5: Output intensity into the cross and bar states as a function of input peak intensity the half-beat length NLDC. The 
solid cur\e is for a square-top pulse while the dashed curve is for a non-soliton sech^O pulse. 



Figure 1.6: Three-terminal half-beat length NLDC. The control pulse is of a different color or polarization than the data pulse. 

1.4.2 Kerr Gate 

The optical Kerr gate was proposed as an ultrafast photonic gating device [41]. In the implementations discussed here, the 
Kerr gate is a three-terminal device in which data is either passed or blocked at the output. Since logic-level restoration is 
not possible, this device is simply an optical switch. The bulk geometry is shown in Figure 1.7. By virtue of the difference 
between self-phase modulation and cross-phase modulation, the gate beam A\ induces a differential phase shift which rotates 
the polarization of the signal beam Aj. as long as there is nonzero projection of the polarization of ^2 onto a direction orthogonal 
to the polarization of A 1 . Assuming plane waves interacting at small angles, the total electric field at the input can be written 

£(0) =.f [Ai(0)c“'“^'-l-coseA2(0)c"'‘^'-|-cc] -fy [sin0/12(0)e“'““'-l-cc], (1.9) 

where 0 is the angle that the polarization vector of A 2 makes with the axis, which is the assumed polarization direction of A 1 . 
Note that in practice beams must be used in order to spatially separate A 1 and A 2 at the output, where the angle of interaction 
is greater than the diffraction angle of either beam such that the beams are spatially resolvable in the far field. An additional 
consideration when dealing with beams (or pulses as in the fiber geometry) is that partial switching occurs analogous to the 
situation with the NLDC. For simplicity, the plane-wave analysis is used and is sufficient for the present purposes. 
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Figure 1.7: Bulk Kerr gate geometry. The polarization of the signal (or data) rotates based on the intensity of the gate (or 
control) 1 . In the absence of/4 1 , A 2 is blocked by the analyzer at the output. For A 1 of sufficient intensity, is transmitted by 
the analyzer. 


After a propagation distance d in the nonlinear medium, the polarization components accumulate linear and nonlinear phase 


Ai(<f) =A|(0)e''^V‘*’' (110) 

A2<(r/) =/l2,(0)e'*oV*^ d-H) 

A2y{d)=A2y{0)e“‘°‘‘e‘’^^>, ( 1 - 12 ) 

where the nonlinear phases are written as 

0,=^_^‘'[|A,(c)+A2,(c)P + 2A|A2,(z)|-]Jz (1.13) 

<I),, = ^j^‘^[|A,(c)+Ai,(c)l- + 2A|A;,(;)|']rfc (1-14) 

<t,3, = ^jj'[|A2,{c)|- + 2AiA,(z)+A.,(2)l']^:. (1-15) 

The factor 2A denotes the cross-phase modulation coefficient and is equal to 2/3 in isotropic media. Assuming that |Ai| » 
|A;1. thus precluding the possibility for large-signal gain, and that there is no absorption or power exchange throughout the 
interaction, the nonlinear phase induced on A: can be simplified to 

4),, = ^|A,(0)|-</ (1.16) 

V 

4>,,, = 2A^|A,(0)r-r/, (1.17) 


with the differential change in phase between the polarization components of A 2 written 

Ad) = - 4>;.. = ^ [1 - 2A] |A, (0) \^d. 


(1.18) 


such that 5=1 — 2A. 

In terms of a Jones vector, the output A: before the analyzer can be written 


A2{d) = A2(0)e'*o‘'e'*7v 


cosGe"^* 

sin0 


(1.19) 
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The transmission of a linear polarizer (analyzer) oriented at an angle ()) relative to the x axis is [42] 


7 '(< 1 >) = 


cos"(t> sin(j)cost!) ] 
sin(t)cos(t) sin^tj) 


The output intensity of the signal beam after passing through the analyzer is therefore 


12 (d) = I2(0) 


sin^ (!) COS' (!> + cos^ 6 cos'* (j) + sin^ 0 sin'* tj) 
+2cos9sin9cos<l)sin(l)cosA4> j, 


( 1 . 20 ) 


( 1 . 21 ) 


where the intensity/ 2 ( 0 ) = Eqc^o |^ 2 ( 0 )r/ 2 . 

Typically, <|) = 0 ± 7i/2 so that complete extinction is obtained with no nonlinear phase shift, thus implementing a pass gate. 

In this case, the output intensity is , nnx 

/ 2 (^) = 2 / 2 ( 0 ) cos^e sin-e[l-cos Ad>]. (1-22) 

In order to maximize the contrast of the device, 0 = 7i/4, leading to the final result 

/.(t/) = /2(0)sin"(A<D/2). (1-23) 

Thcreli>re. a minimum of a 7i phase shift is again needed for complete switching, but over switching results when AO > 7i. A 
contrast ratio can be defined based on the difference between the high and low output states of a single port 

— ^2(^)llow Q 24) 

/;(^)lhigh + /2(^)liow' 

In the case of the pass gate, p = 1, assuming an analyzer with perfect extinction. 

An inverter can also be realized by setting ([) = 0. The output intensity is 

/ 2 (^f) = /:(0) { 1 -2cos-9sin‘9[l-cosA<I>]}. (1.25) 

Again. 0 = t: 4 maximizes the contrast, leading to the result 

/ 2 (d) = /;(9) [1 - sin- (A<I>/2)], (1.26) 


with A<I> = :: for complete extinction of the signal. The contrast ratio of the Kerr gate inverter is 


sin-(A<b/2) 

2-sin*(A<t)/2) 


0< p< 1. 


(1.27) 


Now ihc contrast is limited by the precision to which the intensity of the gate beam can be controlled in order to exactly achieve 
the condition A4> = n for complete extinction of the signal. 

Figure l .S shows the input-output relationships for the pass gate and inverter geometries. The maximum small-signal gain 
occurs when AO = (2m + 1) n/2, but. since /| » /;, small-signal gain is always much less than unity. In addition, there are 
quasi-saturated levels ftir AO = mil. where m is a positive integer. The most important point to note is that over switching can 
occur. As AO increases towards ti. the pass gate switches on and the inverter switches off, but as AO increases past 7t, the pass 
gale switches off and the inverter switches on. Therefore, there are no true saturated levels giving rise to stable states insensitive 
to variations in the gate beam intensity. 

Because of the condition /[(O) » /:(</). large-signal gain is not intrinsically possible with the Kerr gate either. In other 
words, it takes a high intensity gating beam to switch a small intensity signal beam. External amplification can be used to bring 
the high output state after the analyzer to the level required for switching a subsequent stage, but because of the over-switching 
problems inherent in the device, this process may not stable and could eventually lead to attenuation of the signal level after 
a long cascade of devices due to unavoidable amplitude fluctuations. This process can be made stable through the use of a 
saturating amplifier, but the gain recovery time of the amplifier will place a limit on the data rate. 

As an example of the switching intensities needed for the Kerr gate, consider a 1 cm thick sample of fused silica, with 
/i( = 3.3 X 10”'^ cm-/W at X/ = 1.55 pm (see Appendix D). In this situation, a gate intensity of /i(0) = 700 GW/cm^ is 
required. This intensity can be lowered by orders of magnitude by going to a fiber geometry in^which the interaction length can 
be extended to many km’s, as shown in Figure 1.9. Fora 1 km fiber gate, /i(0) = 3.5 MW/cm". Another advantage of the fiber 
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Figure 1.8; Transfer functions for the Kerr gate. The solid curve shows the normalized output versus induced phase shift A<I> 
for the pass gate geometry, while the dashed curve shows the same information for the inverter. 


analyzer 



Figure 1.9: Fiber Kerr gate geometry. The gate and data pulses are of different central wavelengths. The gate pulse is timed 
to co-propagate with and extract one data pulse. Wavelength-division multiplexing (WDM) couplers launch the gate pulse into 
the interaction region and extract the gate pulse at the output for background-free operation. The bulk analyzer shown can be 
replaced by an in-fiber polarizer. 

geometry in communications situations is that there is no need to couple the signal, or data, pulses out of and back into the fiber 
in order to achieve the desired switching operation. 

Since the pump and data pulses must be separated at the output for background-free operation (i.e. high contrast), the two 
pulses must be of different color. An additional constraint is that the fiber must be polarization preserving, or birefringent, in 
order to keep the polariMtion of the data pulses from randomly evolving due to thermal or stress variations. As a result, the 
polarization components of the data pulses pick up a nonzero relative phase due to the intrinsic birefringence 

Ai,{d)=A2x{0)e‘'‘‘‘‘e‘'^^ ( 1 - 28 ) 

A;v(^/)=A2,v(0)e'Ve'‘t>2v_ (1-29) 

where /.x = litn.jKf, ky = Imiy/Xj and /i., and /i, are the indices of refraction along the principal axes of the fiber. The final 
polarization state without the gate pulse is in general elliptical, but the original state of polarization can be recovered using a 
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Figure 1.10: Pulse walkoff in fiber Kerr gate geometry. The fiber length is denoted by d, but the interaction length is limited 
to'he birefringent or dispersive walkoff distance L This diagram illustrates the dispersive walkoff between the gate and same- 
polarized data. The birefringence walkoff between the two components of the data pulse is similar. 


compensator plate. Now, the nonlinear phases are written 

4.2, = 2?^|/'i(0)|V (1-30) 

<t);v = 2A^lAi(0)|'rf, (1-31) 

where the factor of 2 in the x phase is due to the interaction between waves of different frequency, and the four-wave mixing 
terms have been neglected. The differential phase is 

(1.32) 

which, w hen A = 1 /3, is a factor of 2 larger than in the bulk geometry due to the increased cross-phase modulation coefficient 
between different colors in the same polarization state. 

In an early Kerr gate demultiplexing experiment in fiber [43], it was recognized that birefringent and dispersive walkoff 
would limit the performance of the device due to the long interaction lengths needed to obtain a n phase shift with low switching 
energies. The result of walkoff is the reduction of the interaction distance as given by the length of the fiber, as shown in 
Figure 1.10. to an effective interaction distance given by one of the walkoff lengths. If the gate and data pulses have temporal 
duration T, then the w alkoff lengths can be defined 


T 

T _ 


(1-33) 

(1.34) 


where A'(u)j) and k[((iij) are the group delay coefficients for the polarization components at the data wavelength, is the 

group delay coefficient at the gate wavelength. Aco ~ 1 /t is the approximate spectral separation (by one FWHM), and k"{(nj) 
is the group delav dispersion coefficient. For T = 1 ps pulses (for a data rate ~ 100 Gb/sec with pulse separation 10 x), typical 
walkoff lengths in silica fiber are Lbire = 30 m for |njt — riyl ~ 10”^ ^disp = 35 m, using k!^(<)id) = —2.79 x 10 * ps /pm 
from Appendix D. 

The birefringence walkoff length is the distance over which the two polarization components of the data pulse maintain 
overlap by at least their temporal FWHM. This is also the distance over which the gate pulse and the orthogonal polarization 
of the signal pulse maintain overlap. The dispersive walkoff length is the distance over which the gate pulse and the same 
polarization of the signal pulse maintain overlap. In the worst-case scenario, the gate pulse propagates down the fast (slow) 
axis of the fiber and has the greater (lesser) group velocity. The interaction distance is therefore limited to the minimum of 
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Figure 1.11; Diagram of the two-terminal nonlinear optical loop mirror in the intensity-dependent switching mode. 

the birefringent and dispersive walkoff distances. The best-case is for the gate to propagate down the fast (slow) axis and 
have the lesser (greater) group velocity. Here, the interaction distance is limited to the birefringent walkoff length between the 
polarization components of the data pulse. 

The effective interaction distance can be increased to twice the walkoff length by launching the slower pulse one pulse 
duration ahead of the faster pulse, but this technique still does not solve the underlying problem. Birefringent walkoff can be 
effectively eliminated by cross-axis splicing the polarization maintaining fiber such that the distance between each splice is 
less than the walkoff length Lbire [43]. The effects of dispersive walkoff can be reduced by decreasing the spectral separation 
between the data and gating pulses, thus limiting the data rate because l/x ~ A( 0 , or by equally spacing the central wavelengths 
about the zero dispersion wavelength such that both pulses travel with the same group velocities. This does not allow for both 
pulses to be temporal solitons, however. 

A recent fiber Kerr gate demonstration used a 1.5 cm length of silica fiber (which is about Ldisp for the experimental pulse 
duration) to perform demultiplexing of a 460 Gb/s data stream [44]. In the experiment, the peak gate intensity 43 GW/cm- 
(about 5 nJ pulse energy) resulted in a gating efficiency of less than I07r. A n phase change requires peak gate pulse intensity of 
nearly 235 GW/cm- (about 25 nJ pulse energy), which, except for the benefits of compactness and increased coupling efficiency, 
eliminates the original advantages of going to the fiber geometry. 

1.4.3 Nonlinear Optical Loop Mirror 

The nonlinear optical loop mirror [38], or NOLM, has been used extensively in ultrafast all-optical demultiplexing experi¬ 
ments [45,46]. Studies have also focused on the use of NOLMs to realize simple logic gates [45] and more complicated logic 
circuits [ 16). The most basic two-terminal configuration is shown in Figure 1.11. A pulse which enters port 1 is split at a 2x2 
fiber coupler into two counter-propagating pulses which recombine at the coupler. For a perfectly symmetric device, the pulses 
constructively interfere at the coupler and exit from port 1, thus acting as a mirror. Deviations frorn symmetry result in light 
exiting both ports 1 and 2. In nonlinear operation of a non-symmetric device, the pulse intensity can control the ratio of light 
exiting each output port. 

Immediately after the coupler, the field amplitude in each counter-propagating direction of the loop can be written 

Ai = tA\ A 3 = (1-35) 

A 4 = i>Ai A 4 = A 3 e''''’--*' (1-36) 

when it is assumed that no light enters port 2. The unprimed quantities represent the amplitudes before propagation around the 
loop while the primed quantities are after propagation around the loop and just before the coupler. The phases around the loop 
are allowed to be different as Justified later. Here t is the (real) amplitude transmission coefficient and r is the (real) amplitude 
reflection coefficient. When the angle of incidence is less than the critical angle (as it must be here), there is a it phase shift upon 
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reflection from one of the coupler interfaces. Since the 3dB fiber coupler is symmetric, this phase is distributed symmetrically 
with the reflection coefficient in each direction. The amplitudes at the output ports are then 

A'i =rA3 + /M4 = i/M + (1-37) 

A '2 = tA'i + irA'2=Ai , (1 - 38 ) 

where AO = (()cw - <t)ccw- Finally, the output intensities are written 

/;=2r2[l-r][l+cosA<D]/i=4/ir2[l-r2]cos2(AO/2) (1.39) 

/; = /i-4/ir[l-r-]cos-(AO/2), (1-40) 

assuming a lossless system such that + r* = 1. 

For the original NOLM device which switches based on the intensity of the incoming data pulses, the phases are 


<t>ccw —— 4 — [ lA 4 (z)|”dz fti 

A/ Jo 

<l>cw = ^f'4 -r f |A3(c)|‘'£lz W ^i.+ 

Af Jo 


A/ 

^1.43(0)1^ 

Af 


(1.41) 

(1.42) 


where the differential phase is. written 


AO = 


IniizL 


I^3(0)|--|A4(0)|-]‘ 


2 JI/ 12 L 

h 


l- 2 r^] |Ai( 0 )|*, 


(1.43) 


and 6=1- 2r-. For a symmetric device in which r- = 0.5. the phase shift is zero and all the light exits port 1, independent 
from the strength of the nonlinear interaction. Therefore, the loop must be unbalanced in order for self-switching to occur, in 
which the light exits port 2. 

An example of the self-switching of the NOLM is shown in Figure 1.12, which plots the peak intensity in each of the two 
output ports (assuming square-top pulses) using = 0.25. This figure is different than Figure 1.8 of the Kerr gate, which plots 
the switching fraction, not the output intensity. This is done here to emphasize the low contrast of the two-terminal NOLM. 
Like the Kerr gate, over switching can occur for input intensity beyond that required for AO = n, thus making this device 
difficult to cascade. It should also be noted that partial switching will occur if square-top or soliton pulses are not used. 

Because the self-switching behavior is similar to the two-terminal NLDC, a three-terminal NOLM with large-signal gain 
can be con.structed in the same way, by biasing the device with a strong power supply or clock pulse and using a weaker 
control or data pulse (of different polarization or color) to initiate switching, thereby providing complete logic level restoration. 
Unlike the NLDC. the NOLM switching function does not possess saturated levels, thus making the device sensitive to input 
fluctuations, but also allowing for multiple intensity levels for the clock pulse (spaced by even intervals on the figure) to achieve 
inverting operation. Because the switching intensity is fixed at 1/2A for the coordinate system used in the figure, the large- 
signal gam can be arbitrarily high (subject to limitations placed by nonlinear absorption, for example). The output from either 
port 1 or port 2 can be u.sed. The contrast from port 1 is unity, but since this is also the input port, a coupling loss must be taken 
both at the input and at the output in order to extract the switched data. 

The contrast of the NOLM at output port 2 is 


P = 


2/^(1 

\-2r^l-r'-Y 


(1.44) 


which is maximized when r = 0.5. The peak intensity required for a n phase shift is 


/ _ 

'' ~ 2/ia[l-2r2]’ 


(1.45) 


which is infinite when = 0.5. Here, the intensity /| represents the data pulse intensity plus any additional control pulse 
intensity. For example, /j = /daia + 2A/comroi. in the case of the biased device. The minimum required intensity is when r= 0, 
but the contrast is 0. Therefore, in the two-terminal or biased three-terminal NOLM, there is a tradeoff between high contrast 
operation and low switching intensity (energy). 

Some of these problems can be solved by using a balanced ir^=0.5) and un-biased three-terminal NOLM [47]. This 
configuration is shown in Figure 1.13, in which an external control pulse of different polarization or wavelength is used to 
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Figure 1.12: Switching characteristics of the two-terminal NOLM. If a non-square-top or non-soliton pulse is used, the output 
enersy will not follow the intensity curve and the energy contrast will be reduced over the ideal case of unity for port 1 or 
as gh’en by equation 1.44 for port 2. The amplitude reflection coefficient of the 2x2 fiber coupler is set to r = 0.5, giving a 
reflectance of = 0.25. 


switch a much weaker data pulse by inducing a n phase shift on the counter-clockwise pulse via cross-phase modulation. In 
this case, the phases of the counter-propagating pulses can be written 

«ecw=itoi- + 2A^l4c„(0)|' (1.46) 

Xf 

<J)cw — I:oA. 

where ,4cii is the amplitude of the control pulse, which co-propagates 4ccw and therefore counter-propagates /tew, inducing 
negligible nonlinear effect on the latter. For orthogonal polarizations, 2A = 2/3, while for different wavelengths, 2A = 2. In 
this device, the sw itching intensity and contrast are independent of r, because the change in phase is 

A«I> = 2A^K.,(0)1- (1.47) 

and the contrast at output port 2 is p = 1 assuming that the control pulse intensity can be maintained to achieve AO = 7t. 

Due to the long interactions lengths of 10 km or more [46], these switching elements suffer from polarization or dispersive 
pulse walkoff (unlike the fiber Kerr gate, the control pulse does not have to be of different polarization and wavelength). 
Walkoff can be minimized by using cross-spliced polarization maintaining fiber or choosing the control and data wavelengths 
symmetrically about the zero dispersion wavelength [45]. as discussed for the fiber Kerr gate. Recent experiments have achieved 
demultiplexing of 100 Gbit/s data streams [48] in a 6 km loop using 1 pJ control pulses, and two-stage cascadability using low 
birefringence fiber loops with an amplification stage in between [49]. 

1.4.4 Terahertz Optical Asymmetric Demultiplexor 

The terahertz optical asymmetric demultiplexor [50], or TOAD, was developed to reduce the long latency inherent in the NOLM 
and reduce switching energies by asymmetrically positioning a thin, highly (resonant) nonlinear element on one side of center 
of a fiber loop mirror, as shown in Figure 1.14. The fiber nonlinearity is not utilized and therefore the fiber loop can be on the 
order of meters in length rather than kilometers. 
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Fiiiurc 1.13: Three-terminal NOLM in two-pulse configuration. Wavelength division multiplexing (WDM) or polarizing (PBS) 
couplers transfer the control pulse into and out of the loop with nearly unity efficiency. For polarizing couplers, the fiber loop 
must be polarization maintaining. The control pulse only propagates counter-clockwise, and only interacts strongly with the 
counicr-clockwise component of the data pulse. 



Fiijure i.!4: Three-terminal terahertz optical asymmetric demultiplexor. The nonlinear phase shift is provided by the control 
pulse excitation of an asymmetrically placed nonlinear element. 


The TOAD switch works similarly to the three-terminal NOLM by imposing a n phase shift in one arm (direction) of the 
loop, and cannot be configured as a true logic gate. The differentiating factor from the NOLM is that a thin nonlinear element 
excited by the control pulse is responsible for the nonlinear phase shift induced on the data pulses. In the diagram, pulses 
1-3 pass through the nonlinear element before the control pulse and experience no nonlinear phase shift in the clockwise and 
counter-clockwise parts because the nonlinear element is in the un-excited state. As a result, there is no differential phase 
shift (AO = 0) and the pulses exit the input port (or reflect). Pulses 5-8 pass through the element after the control pulse, but 
now each direction experiences a nonlinear phase shift, but, given constraints of long relaxation time of the nonlinear element 
as discussed later, experience no significant differential phase shift and also exit the input port. Pulse 4, on the other hand, 
switches out the other port because the clockwise propagating part of the pulse passes through the nonlinear element before the 
control pulse and experiences no nonlinear phase shift, but the counter-clockwise part reaches the nonlinear element just after 
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excitation by the control pulse and experiences a nonlinear phase shift. If the differential phase shift A4> — n, then the pulse 
will exit the output port. 

The counter-clockwise and clockwise phases can be written 

<t*ccw 
<l>cw 

where /, is the time that the clockwise portion of the /''' pulses reaches the nonlinear element, r, H-2Av/v^ is the time that the 
counter-clockwise portion of the same pulse reaches the nonlinear element, roi is the time that the control pulse excites the 
nonlinearity, and is the group velocity. The time-dependent nonlinear index change is written 

A/i(r)= / /?(T)K,i(f-T)|-<fT f>0 (1-50) 

= 0° /<0, (1-51) 

where R{x) is a nonlinear response function, typically of the form of an exponential decay. It is clear that, assuming an 
instantaneous rise time. 

ii<Uu<ti + 2Ax/vg, (1-52) 

in order to switch the pulse. A much more complete analysis has been performed [51], with the basic conclusion that the 
nonzero length of the nonlinear element places a restriction on the minimum duration of the switching window, which must be 
at least twice the transit time through the element plus the rise time of the nonlinearity. The relaxation time of the nonlinearity 
.sets the lower bound on the frame time because the nonlinear material must be in the ground state before re-excitation by 
another control pulse. These requirements are expressed by [50] 

trise Ibil Tfjyrne. (1-53) 

where Tnsc is the rise-time of the nonlinearity and can usually be considered instantaneous, tir-jnsit the transit time through 
the nonlinear element, 7b„ is the time between adjacent bits in the data stream, Xfati is the fall time of the nonlinearity and does 
not have to be instantaneous as in the other devices which require Tfan Tbit, ^nd Tframe ts the frame time for time-division 
multiplexed data. The fall time Tfaii must be greater than the time difference 2A.x/vg between when the two parts of the same 
pulse pa.ss through the nonlinear element so that the clockwise and counter-clockwise portions accumulate approximately the 
same nonlinear pha.se shift. A disadvantage is that only one data pulse can be switched within the relaxation time of the 
nonlinear element, so that the switching events must occur at least Tfaii apart. Unlike the other devices, the TOAD is relatively 
insensitive to timing jitter between the control and data puKses. All that is required is that the control pulse arrives (up to one 
bit time) at the nonlinear element before the intended data pulse. The amount of timing jitter that can be tolerated is limited to 

Thji “ Tnst* 1 •• 

Because ihe nonlinearity need not be instantaneous (and m fact the fall time must be long), very large, resonant nonlinearity, 
such as that produced by real particle excitation and decay, can be used. The optical path length of the nonlinear element must 
be less than that between the element and center and, since the nonlinear phase shift is proportional to the product of nonlinear 
index, control pulse intensity, and the thickness of the nonlinear element, the latter is perhaps the main limitation on high 
contrast, low energy switching for THz data streams. A recent demonstration [52] showed single-pulse demultiplexing of 
250 Gbit/s data with 0.8 pJ switching energy. 

A related device, based on a Mach-Zehnder configuration with asymmetric placement of a nonlinear element in each of the 
two arms (where the difference between the distance from the coupler to each element is Ar), has also been demonstrated [53]. 
As in the TOAD device, a control pulse is used to excite the nonlinear elements, where the time difference between the excitation 
of each element leads to a switching window of duration Av/i j,, but again, large-signal gain cannot be realized. Because the 
two arms are physically distinct, any differential environmental variations can disturb the operation of the gate, but the effect is 
less important for an integrated device. 

Another variation, which perhaps is more akin to the fiber Kerr gate, is the so-called ultrafast nonlinear interferometer [54], 
or UNI. This device is shown in Figure 1.15. The input data pulses split equally along the polarization axes of birefringent fiber. 
Because the group velocity is different along each axis, the polarization components walk-off from each other in time. The 
lime difference between the leading and trailing components allows for asymmetric excitation of the nonlinear element by a 
strong gate pulse such that the leading pulse is affected only by the amplification process of the device (the device is electrically 
pumped) and the trailing pulse is affected by the device after gain saturation by the strong gate. Therefore, due to sub-ps carrier 


= + A—AM(r/ - rcii + 2Ax/v^) (1-48) 
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Figure 1.15; Three-terminal ultrafast nonlinear interferometer. A differential phase shift is imparted onto the data pulse at the 
semiconductor optical amplifier (SOA) by asymmetric excitation with the gate pulse. 

heating and gain depletion, and virtual electronic processes, both differential amplitude and phase changes are induced between 
the data components. Long-lived carrier population and thermal nonlinearities ensure that no differential change is induced 
between subsequent data pulse components. After traveling through another length of birefringent fiber with fast and slow axes 
re% ersed. the components overlap at an analyzer biased to produce the desired logic operation. For example, if the analyzer 
IS set to pass the data pulse in the absence of the gate pulse, then the device will act as an inverter, albeit without logic level 
restoration and large-signal gain. 

Like the TOAD, the switching rate is limited by the gain recovery time of the SOA, but it should be noted that only enough 
recovery time is needed so that the control pulse can induce another n phase shift. If only TDM de-multiplexing is performed, 
then recovery time only limits the frame period, while the bit rate is limited by the optical thickness of the element and the rise 
time of the nonlinearity. For switching operations that occur on a bit-by-bit basis, then the n phase shift recovery time places 
the mam limitation on bit rate, which can be as high as 100 Gb/s [55]. A recent experiment [11] demonstrated the cascadability 
of this three-terminal device by implementing a 40 Gb/s inverting shift register. A fiber amplifier was used to bring the inverted 
data output of the device up to the necessary 0.1 pJ switching energy. 

1.5 Optical Soliton Logic Devices 

The devices discussed in the previous section all suffer from drawbacks that may limit their use in cascaded digital logic 
and switching systems. Except for variations of the nonlinear directional coupler (NLDC) and nonlinear optical loop mirror 
(NOLM). none of these switching devices has complete logic level restoration with large-signal gain. In the standard config¬ 
urations. the data pulse itself is transmitted to the output. The strong control pulse only initiates the switching and is thrown 
away afterwards As a result, the transmitted data is never restored allowing signal level variations to build up over time. The 
roles of control and data can be reversed such that weak “power supply” pulses are passed on to subsequent gates, and are 
controlled by strong data pulses. Here, restoration is almost complete, with the lack of obtaining a valid signal level without 
external amplification which introduces ASE noise and bandwidth limitations. The biased three-terminal NLDC and NOLM 
have complete logic level restoration because of large-signal gain, but the gain of the NLDC is very limited and suffers from 
low contrast m practice, and the NOLM does not possess saturated levels and is lossy in order to achieve high contrast. 

In the three-terminal configurations, these devices are only directly cascadable when using control or gate pulses of or¬ 
thogonal ptilarization to the data pulses. Cascadability is achieved simply by using a half-wave plate to rotate the polarization 
of the output to serve as a control input. Using different colors also allows for cascadability, but the system becomes more 
complicated. In the first realization, the output of the gate must be converted through some nonlinear process such as three- or 
four-w'ave mixing, to the appropriate control input color. A second realization is to alternate the correct control color at each 
stage such that no wavelength conversion is necessary, but this may require two device designs in order to achieve optimum 
performance. These two-color schemes are the only way to cascade the fiber Kerr gate, since different colors are necessary in 
the fiber geometry, and the NLDC, because the use of different polarizations greatly reduces the switching performance [33]. 

The Kerr gate. NOLM, and terahertz optical asymmetric demultiplexor (TOAD) all suffer from over switching, although 
saturated gain in the nonlinear element can clamp the total induced phase shift in the latter device. In each of these three 
devices, a control intensity higher than that required for a n phase shift will cause over switching [45] and result in reduced 
contrast. Therefore, each gate in the system must be precisely toleranced if high contrast operation is required, but because of 
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unavoidable variations, the signal level will be attenuated through a long cascade with eventual loss of the logic level. 

The performance of all of the devices, except TOAD, is reduced considerably by walkoff and incomplete switching. Means 
of reducing walkoff were discussed for each device, but it should be noted that, since the use of orthogonal polarizations is 
more practical for cascadability considerations, cross-axis splicing is an elegant solution. Because square-top pulses possess 
high frequency components at the leading and trailing edges leading to rapid pulse broadening, the most robust way to eliminate 
incomplete switching is to use temporal solitons for the control and data pulses. The use of temporal solitons can also reduce 

or eliminate walkoff by the mechanism of temporal trapping [56,57]. 

The limitations of these devices motivates the study of a new class of devices based on optical solitons, which make full 
use of the soliton properties. Because solitons are threshold phenomena, they are natural carriers of digital information. Below 
a critical power or energy, a nonlinear wave will asymptotically disperse, but above the critical power or energy, a nonlinear 
wave will evolve into one or more stable (possibly higher-order) solitons. Therefore, binary logic levels can be represented by 
the amount of detected power or energy, or even by spatial or temporal size (a soliton could readily pass through a spatial or 
temporal aperture, while the majority of a dispersive wave would not). A related issue is resolvability. Since solitons beat the 
linear diffraction/dispersion limits, a logic gate (without absorption) could be arbitrarily long. This property is used to great 
advantage in the soliton collision and dragging gates as discussed later, where a small angular/frequency change (induced by a 
nonlinear phase shift less than n) manifests itself as a resolvable spatial/temporal shift. 

Other benefits arise from the use of solitons in digital logic systems. Solitons are stable to weak perturbations meaning 
that the soliton tends to maintain its shape in the presence of material inhomogeneities or input profile variations, which is an 
important property for logic restoration. In fact, the original amplitude and shape can be restored even after absorption by the 
use of adiabatic amplification. This property is useful for logic restoration and in long haul communications. Additionally, in 
some cases, solitons will maintain their shape even after collision with another soliton, which turns out to be a disadvantage 
for optical switching. The logic gates presented in this thesis use a configuration in which the interaction is inelastic such that 
soliton shape is not necessarily maintained after collision. The most important benefit to optical switching is that solitons (or 
nonlinear waves in general) can exert a force on one another, unlike light in linear propagation. These forces can alter the 
direction in space or velocity in time of one or both solitons resulting in a switching or logical operation. 

In order to achieve low switching energy per gate operation, the ideal optical soliton logic gate should be based on one 
of the three geometries that allow for complete three-dimensional confinement [2]: 1-D temporal solitons in fiber, 2-D spatio- 
temporal solitary waves in slab waveguides, and 3-D light bullets. The advantage of spatio-temporal solitary waves over 
temporal solitons for switching and logic is the removal of one or two transverse dimensions of linear confinement. This allows 
for spatial parallelism (without the loss of ultrafast temporal pipelining) and the freedom to use spatial interactions with much 
easier output state discrimination [2,58] than their temporal counterparts [59] in order to implement amplitude keyed logic for 
eventual conversion to the electronic domain. Even though one dimension of spatial parallelism is lost, the 2-D solitary wave 
has the important advantages over the 3-D “light bullet" case in that it should be more easily realizable experimentally [60], and 
the slab waveguide geometry allows for the possibility for tailoring of the dispersion properties [61], photolithographic circuit 
definition, and crxiling through the area of the substrate. The (2+l)-D case also presents an efficient test bed allowing for the 
study of the effects of higher-order terms in the evolution equation by retaining full spatio-temporal dynamics without resorting 
to the time-consuming (3+1 )-D simulation. 

The remainder of this section briefly discusses two soliton-based logic gates based on the trapping interaction. These gates 
satisfy the requirements for a logic device, including large signal gain and logic level restoration. The first device is the temporal 
soliton dragging gate [62], and the second is the related spatial dragging gate [58], which is the main subject of this thesis. Note 
that other soliton interactions can be used for logic devices and are discussed further in Chapter 5. 


1.5.1 Temporal Soliton Dragging Gate 

The soliton trapping [62] and dragging [59] gates are based on the temporal trapping mechanism [56,57]. Two nonlinear 
waves of different polarization and/or wavelength propagate down a fiber with different group velocities. If they are initially 
overlapping in time, in linear propagation, one will reach the end of the fiber before the other. In nonlinear propagation though, 
they can trap each other through nonlinear attractive forces such that they both propagate at the same group velocity. In this 
case, each wave must experience a frequency shift in order to propagate at the common, weighted-mean, group velocity. The 
two nonlinear waves do not need to be of the same size (in terms of pulse area) in order for trapping to occur [57], and the 
frequency shift experienced by the smaller wave is greater than that experienced by the larger wave. 

Figure 1.16 shows the generic three-terminal gate geometry. In the most basic geometry as shown, these gates perform an 
inversion operation in which the pump is passed in the absence of the signal and blocked by a spectral filter or time gate (as 
discussed later) in the presence of the signal. Note that the signal is always blocked at the output and the pump of one gate 
becomes the signal of the next, resulting in true three-terminal operation. Since the pump pulse may be passed on to later gates. 
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Figure 1.16; Temporal soliton trapping/dragging gate. Soliton pump pulses (always present) initially overlap in time wiA (non- 
soliton) data or signal pulses of different wavelength and/or polarization. Cross-induced chirp causes the pump and signal to 
propagate with the same group velocity. This frequency shift must be resolvable for the trapping gate. The change in group 
velocity becomes an arbitrarily large timing shift after propagation through a dispersive fiber delay line. For the dragging gate, 
this liming shift must be resolvable. 


it IS convenient for it to be a temporal soliton in order to maintain its shape after propagation through many tens to hundreds of 
meters of fiber, thereby ensuring logic level restoration. The trapping and dragging gates use pulses of the same wavelength but 
with orthogonal polarizations. Note that similar type gates can be constructed using pulses of different wavelength [63], based 
on the analogous trapping mechanism [64], but this gate is not readily cascadable because frequency shifters are necessary 
to drive the subsequent gate, while in the case of different polarizations, a wave plate can be used to rotate the output pump 
polarization to the correct state. In either case, the gate operation is independent of the relative phase between the pump and 

signal . -•11 

The soliton trapping gate relies on a resolvable spectral shift of the pump soliton [62], so that an unshifted pump will pass 

throuch a spectral bandpass filter while a shifted pump (in the presence of the signal) will not. This results in amplitude keyed 
louic and is compatible with common high-speed optical detectors for eventual conversion into an electronic signal. In order for 
th^s to occur, the fiber birefringence must be sufficiently large to produce the necessary difference in group velocity between the 
pulse propagating down the slow axis and the pulse propagating down the fast axis, which when compensated due to trapping, 
results in a spectrally resolved shift. The pump and signal pulses must also be of nearly the same amplitude so that the shift is 
not weighted preferentially towards the signal. As a result, this gate cannot provide significant large-signal gain. In fact, the 
first experimental demonstrations [62,65] used solitons of 300 fs duration and 42 pJ energies for the pump and signal. A final 
note is that, because complete trapping can occur in a few soliton periods, the gate length need only be a few tens of meters in 
length, thereby reducing latency. 

The temporal soliton dragging logic gate utilizes the fiber dispersive delay line as a “lever-arm” in order to allow a weak 
control or signal soliton to "drag” a strong pump in time [3]. In this way, even a small spectral shift of the pump (i.e. less 
than a n pha.se shift induced by a weak signal pulse) can result in a large time delay due to non-zero group delay dispersion. 
Because the pump is a temporal soliton and does not broaden in time, a resolvable temporal shift can be achieved by choosing 

the appropriate length delay line, according to the expression 

Ao)/:^ L > TpwHM. (^ 

where the frequency shift Am is a function of the pump and signal pulses and the birefringence, and Ic'qL, is the group-delay 
dispersion. Therefore, there is a tradeoff between gain and fiber birefringence (which determine the amount of spectral shift) and 
gate length. If the pump were a linearly dispersive wave, broadening occurs at the same rate as the temporal delay (neglecting 

higher-order dispersion) and temporal resolvability can only be achieved by a resolvable spectral shift. 

" Temporal dragging logic must be time-shift keyed, which is still fully cascadable in the optical domain (but may limit bit 
rate somewhat) for a clocked system, but not compatible with high-speed electronic detection. Detection can be achieved by 
using a trapping gate in the last stage [3] or an ultrafast optical gating mechanism, such as the Kerr gate. One problem with 
time-shift keyed logic is that the dragged pump is difficult to remove from the system (although it is blocked by the polarization 
beam splitter at the output of every second dragging gate) and may cause interference in subsequent switching stages. Note that, 
in contrast to the problems imposed by birefringent walkoff on the previous fiber gates, the temporal soliton trapping/dragging 


* when neglecting the nonlinear four-wave mixing effects 
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Figure 1.17: Temporal soliton dragging NOR gate geometries. Diagram a) shows a two-input soliton dragging gate which 
implements NOR. Diagram b) show two cascaded single-input soliton dragging gates which also implements NOR. 


gates use birefringent walkoff (in combination with nonlinear cross-phase modulation) to induce the switching operation. The 
only constraint is that the walkoff length should be greater than about one soliton period because the bulk of the interaction 
occurs within that distance [66]. 

The finst demonstration [59] of temporal soliton dragging was of a two-input, logically complete NOR gate. Two possible 
realizations of this logic gate are shown in Figure 1.17, where the first geometry is the one used in the experiment. In this 
experiment, the 3(X) fs pump pulse had energy of 132 pJ. while the non-soliton control pulses had 30 pJ energy, realizing a 
gain of 4.5 with high contrast. The fiber length was 75 meters, giving a gate latency of 360 ns. A cascading demonstration 
was al.so performed using a geometry similar to that shown in Figure 1.17 a). This demonstration suffered from the effects of 
Raman amplification, in which lower spectral components of a pulse are amplified at the expense of higher spectral components, 
resulting in asymmetric temporal broadening of the pump pulse, and may limit cascadability due to the shift in mean frequency. 
When both control pulses are present, the output of the device is sensitive to the relative phase between the control inputs. 

Using the geometry of b). a follow-on work [66] obtained switching with reduced control pulse energy and no phase 
sensitivity. In this demonstration, a 500 fs, 54 pJ pump pulse was used along with 6 pJ control pulses. The pulse durations 
were lengthened and energies were lowered to reduce the effects of Raman gain. For this geometry, the length of fiber between 
the coupling regiirn for each control pulse need only be that required for the interaction to occur (about 75 meters in this case). 
The long dispersive delay line (350 meters) can be placed at the end in order to achieve the timing shift due to the spectral 
shift induced by either control. This is one instantiation of the more general time-domain chirp architecture [3]. Dragging was 
obtained with lower control pulse energies because each control pulse completely overlapped the pump pulse in time, resulting 
in a stronger interaction than in the partial overlap geometry of a). 

Subsequent extensions to the time-domain chirp architecture used 30 m of moderately birefringent fiber in which the spectral 
shift occurred, followed by 2 km of polarization maintaining fiber as the dispersive delay line to obtain the timing shift [3], In 
this case, the control pulse energy was about 1 pJ. Another modification used a short ('^ 2 mm), highly nonlinear semiconductor 
as the nonlinear chirping element, followed by 600 m of fiber. Control energies were on the order of 100 pJ because the control 
and pump were slightly offset in time to achieve spectral shift upon trapping due to the lack of strong group velocity mismatch 
in the semiconductor. 

These trapping and dragging interactions are described by the coupled, temporal NLS equations [67] 


+ 2/157 [|Ai I-+2AIA2P] A, = 0 


( 1 . 55 a) 
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where is the difference between the group delay coefficients, and the reduced time is defined T = t-k()Z with Icq being the 
average group delay coefficient. Here, the group-delay dispersion coefficients are assumed equal and A\ and A 2 represent the 
orthogonal polarizations. Note that the vector four-wave mixing terms have been neglected due to unavoidable intrinsic, stress- 
induced, or bend-induced birefringence which leads to accumulating phase difference between the orthogonal polarizations. 
Much more will be said about this fundamental system of equations in later chapters of this thesis. 


1.5.2 Angular Deflection Gate based on Spatial Soliton Dragging 

Spatial dragging gates [58] are one instantiation of a more general class of angular deflection logic gates studied in this thesis. 
The first spatial trapping gate was demonstrated experimentally [ 68 ] using spatial solitons of the same polarization (i.e. phase- 
dependent). but propagating at different angles. Due to cross-focusing, which is the spatial analogy to cross-phase modulation, 
the solitons mutually attract and under the right condition, form a trapped pair. The analogous temporal trapping interaction 
between two temporal solitons of different color (and hence different group velocities), has been studied analytically [64] and 
experimentally [63]. The direct spatial analogy to the Islam temporal trapping [62] and dragging [59] gates is the interaction 
between spatial solitons in the orthogonal eigenpolarization states of a uniaxial crystal [69]. In linear propagation, these 
eigenstates would normally walk-off in space, but nonlinear cross-focusing causes trapping or dragging to occur. A different 
arraneement, which is the one used in this thesis, has also been analyzed theoretically [58,70], in which the interaction occurs 
between tilted orthogonally-polarized spatial solitons in linearly isotropic media. The temporal analog to this interaction is 

temporal solitons of different color and orthogonal polarization. 

The use of solitons or solitary waves in optical logic is critical in that solitons beat the diffraction and/or dispersion limit over 
distances much longer than the characteristic linear lengths. Here, the main interest is in the use of lateral spatial confinement 
over distances larger than the linear diffraction distance. The key idea is illustrated in Figure 1.18, which shows the basic 
angular deflection logic gate. A pump soliton (left-hand side) propagates the length of the gate (which is assumed to be in 
a slab wasceuide geometry) and passes through a spatial aperture at the output, forming the high output state of the device. 
Because the spatial soliton does not diffract, the size of the aperture can be the same as the size of the wave at the.input, 

independent of the actual gate length. • • • t 

The switching operation is performed by disturbing the propagation of the pump beam such it does not exit the spatial 
aperture, thus providing the low output state of the device. This can be accomplished by inducing a change in propagation 
angle which leads to a spatially-resolved shift at the output. If the beam propagates linearly (as shown on the right-hand side), 
a s"patiall>-resolved shift can only be produced by inducing a change in the propagation angle (with a change in phase across 
the aperture of the beam of at least n) that is greater than twice the linear diffraction angle. In nonlinear soliton propagation, an 
induced angle change that is less than linearly resolvable (i.e. less than a 7t phase change across the spatial aperture) results in 
a diflerential spatial shift which can be integrated over non-diffracting propagation such that a spatially-resolvable shift occurs 
at the output Thus, as shown in the figure, the gate length must be at least the minimum resolvable dragging distance, which 
depends on the spatial w idth of the soliton beam and the amount of angular change, much like the case of the deflection of a 
linear pump by a much larger angle and well within the linear diffraction length. 

The angular change is produced through the nonlinear interaction between the pump and another, orthogonally-polarized 
beam, called the signal, which initially overlaps the pump and propagates at a non-zero relative angle. The signal beam must 
be strong enough to induce a nonlinear index change felt by the pump through cross-focusing. If the pump propagates linearly, 
then large-signal gain is not possible because the signal beam is necessarily stronger. In this case, there is no mutual nonlinear 
interaction and the nominal effect of the signal is to create a nonlinear prism which deflects the pump in the direction of the 
signal. The deflection angle depends on the relative propagation angle and intensity of the signal, with the maximum angle 
occurring when the pump is completely guided, or trapped, by the signal. Therefore, the angle 0 must be at least the diffraction 
angle of the pump because the deflected pump will be nonlinearly guided by the signal, which may be a spatial soliton in the 
limiting ca.se. 

If the pump is a spatial soliton. then large-signal gain is possible, but the interaction is more complicated due to mutual 
nonlinear coupling; 


2iko ^ + 2ik, ^ ^ -H 2 A 5 [|4,1^ -b 2A h, 1^] A, - 0 

2 /ko^+ ^ + [lA,p-b 2 AlA,|-]A. = 0 , 


(1.56a) 

(1.56b) 
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Figure 1.18; Logic gate geometry based upon the light-induced deflection of an optical “pump” beam away from a spatial 
aperture at the output. At one extreme, the pump can propagate nonlinearly as a spatial soliton (left), or at the other extreme, 
the pump can propagate linehrly and diffract (right). Deflection is induced by the cross nonlinear interaction with a signal beam 
which is tilted with respect to the pump. The dashed contours represent the deflected pump. 
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Figure 1.19: Transfer function for spatial soliton dragging logic gate demonstrating small-signal gain and saturated levels. The 
pump and sicnal are fundamental solitons of width h’q. The signal soliton amplitude is varied to obtain the increasing input 
power as indicated on the horizontal scale. This transfer curve illustrates a large-signal gain of 2.0, as discussed in section 5.4. 

w here, for consistency with temporal interaction equations, the angle of propagation 0 of the signal Ay (which is analogous 
to group delay I is included explicitly in the paraxial approximation such that 6 = ky/ko- Here, ko is the wavenumber, ns is 
the nonlinear Kerr index, and Ay is the pump envelope. This is the base system of equations used in Chapter 5 for the study 
of the interaction between spatial solitons. Now, during propagation, each beam affects the other through cross-focusing. If 
larce-signal gam is realized, then the pump exerts a greater attractive force on the signal than the signal does on the pump. 
Nevertheless, as mentioned previously, only a small angular deviation of the pump is needed. As in the case with the linear 
pump, two interaction scenarios can occur; each soliton simply deflects the other, or they form a bound, orbiting pair propa¬ 
gating at the weighted-mean angle (58]. The former is typically referred to as dragging [59], while the latter is referred to as 
trapping |65|. Soliton trapping is more likely to occur when the relative propagation angle is small and the solitons are nearly 
the same size, while deflection, or dragging, occurs for large angles and/or large gain. Note also that the optimal interaction 
may not occur in complete trapping, in which case part of the signal, the “shadow” [71], may remain bound to the pump, while 
the rest propagates at a much larger angle as an unbound linear diffractive wave. Therefore, this interaction will be genetically 
referred Id as dragging since the high gain situation is of ultimate interest. 

figure 1.19 shows a typical input-output relation for the spatial dragging logic gate. Note that the same function would be 
obtained for the temporal dragging gate as well. The most important feature to note is the region of small-signal gain at the 
input threshold level near 0.3, surrounded by saturated levels. The transfer function shows that the dragging gate has the same 
operational characteristics as an nMOS inverter [26], as shown in Figure l.l. Here, the role of the electric power supply is 
provided by the pump wave and the role of the gate voltage is played by the signal wave. The presence of small-signal gain at 
the threshold level allows the output to be driven low with very sharp switching characteristics. The threshold level is the point 
on the curve in which the input and output levels are the same. For an input level beyond threshold, the output of the device 
is switched into a valid low state; therefore, the threshold level is the minimum input signal level required to switch the output 
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state of the device, and determines the operating point on the transfer curve where large signal gain is unity. When small signal 
gain is greater than unity at the threshold level, then the input switching level can be chosen such that large signal gain and 
contrast greater than unity can be obtained. The transfer curve is terminated by saturated levels, showing that over switching 
is not possible. As a result, when operating well into the “on” and “off’ regimes, small variations in the input signal level are 
attenuated and do not affect proper gate operation, thus providing large noise margin. A more detailed examination of soliton 
logic gates, based on spatial interactions, is presented in Chapters 5 and 6. 

Because these gates have two inputs and only one output, they are three-terminal devices with input-output isolation. Only 
the undeflected pump is passed on to subsequent gates, and it is important that the pump propagate stably over the length of 
the gate with little change in its physical parameters. This allows for restoration of not only the logic level (because of small- 
signal gain), but also restoration of timing, position, polarization, color, and shape, which is crucial to cascaded operation [25]. 
cLcadlng of these gates can result in more complex logic functionality, as discussed in the next paragraph. Since the pump and 
signal have orthogonal polarizations, the interaction between them is phase insensitive. In the case of linear polarizations, this 
is not strictly true because of the presence of phase-dependent vectorial four-wave mixing terms in the nonlinear polarization. 
Here, it is assumed that these terms can be neglected because of waveguide birefringence, which causes each polarization 
to propagate with a different phase velocity. This assumption is valid when the interaction length is much greater than the 
birefringence beat-length. 

In analogy with temporal dragging logic gate [66] of Figure 1.17 b), logically complete, n-input NOR gates can be imple¬ 
mented with spatial dragging in an n-stage system in which a cascaded pump is dragged to the side and blocked by the presence 
of a sismal in any stage, thereby producing a low output, as shown in Figure 5.35 for a two-input NOR. In this implementation, 
the same pump is passed through two (or more) subsequent stages, so that standardized output levels (in terms of both the 
inverter and the multi-input NOR) may not obtained due to absorption incurred in the additional stages. Complete logic level 
restoration is not strictly obtained either, because broadening of the pump in the presence of absorption also depends on total 
propagation distance, as discussed in section 5.3, such that the shape is not necessarily restored. However, complete logic level 
restoration is not necessary for a small cascade of gates (just as with electronic pass gates) and it is important to note that the 
pump soliton is the one that passes through multiple levels. Absorption will be the ultimate limit to the fan-in (number of 
stages) of this type of NOR gate. Another implementation, in which the total length of a multi-input NOR gate is held constant, 
so That the length of each signal input stage gets smaller with the degree of fan-in, results in both standardized output levels and 
complete restoration, and is discussed in more detail in sections 5.4.3 and 6.3.2. 

The main benefit of the spatial dragging logic gate over the temporal dragging gate is that amplitude keyed logic is straight¬ 
forward to implement while providing gain. In the spatial case, an aperture can be used to discriminate the output. For temporal 
dragsing though, an ultrafast time gating mechanism is required to implement amplitude keyed logic instead of the more natural 
time-shift keyed logic [3). A spectral filter is all that is necessary to implement amplitude keyed logic for temporal trapping, 
but the leverage of the dispersive delay line is lost and gain cannot be provided. Therefore, only the spatial dragging geometry 
has the simultaneous advantages of simple output state determination and the leverage of non-diffracting propagation which 
allows for large-signal gain. Additional advantages are spatial parallelism, as discussed previously, and latency. Typical spatial 
soliton logic gates as studied in this thesis are on the order of cm’s long, while typical temporal soliton gates are on the order 
of lO's of m‘s. This results in a factor of 1000 reduction in latency. Much shorter temporal solitons can be used to reduce the 
length of the fiber-based gates, but higher-order temporal effects can cause problems as discussed in Chapter 6. 

Spatial dragging of purely spatial solitons loses the advantages of gate-level temporal pipelining and low energies (due to 
complete confinement) of the temporal dragging gates, though, and spatial dragging between non-soliton pulses could yield 
partial switching behavior with low contrast, as shown in previous sections. That is why, in Chapter 6, this thesis studies spatial 
dragging of spatio-temporal solitary waves in order to retain the best features of short gate lengths, ultrafast operation and 
temporal pipelining, and low switching energies. The spatial dragging gate utilizing spatio-temporal solitons is perhaps the 
only all-optical switching and logic technology that satisfies all of the requirements for digital switching and logic and has the 
potential for implementation into large scale systems utilizing the high degrees of spatial and temporal parallelism available to 
optics. 

Figure 1.20 shows the generalization of the deflection gate architecture to the use of 2-D Spatio-temporal solitary waves. 
Utilization of fully-confined spatio-temporal waves allows for low energy operation and the additional flexibility to use spatial 
and/or temporal interactions. Here the focus is on spatial dragging, but temporal trapping/dragging may also occur and allow 
for the reduction of the requirements on allowable timing Jitter between the pump and signal. Note that after the first gate, 
timing is restored in the system (because only the undeviated pump clock stream is passed) such that jitter no longer becomes 
a concern. Also, since the solitary wave does not disperse, temporal pipelining can be employed at the gate level, resulting in 
very high throughput which is independent of actual gate length. This architecture maps well onto the processing of a large 
number of independent data streams, such as ultra-fast time-division multiplexed transmission, bit-serial computation [20], and 
bit-serial digital signal processing [18], as discussed briefly in section 1.2. , 
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Figure 1.20: Spatio-temporal solitary wave dragging logic gate. A pump solitary wave (left) will propagate the length of the 
gate and pass through a spatial aperture at the output. In the presence of a tilted, orthogonally-polarized signal (right), mutual 
deflection will cause the pump to shift to the side and not pass through the aperture. The inset schematically shows the input- 
output relation for the gate. The threshold level is set at the point where the input and output are equal. This gate performs an 
inversion operation and can achieve large-signal gain. 
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It is important to make clear that dragging is an asymmetric interaction. The nonlinear waves are brought into temporal 
and/or spatial coincidence in linear propagation, such that they overlap at the beginning of nonlinear propagation. For a 
symmetric interaction, such as collision, the waves are brought into coincidence in purely nonlinear propagation where the 
nonlinear interaction forces are present both before and after the point of overlap. These interaction forces are symmetric in 
that the net change in frequency or angle is zero (the change before complete overlap is exactly compensated by the equal and 
opposite change after overlap), but with a small (not resolvable) residual temporal or spatial shift. Since the dragging interaction 
is asymmetric, the nonlinear interaction forces occur only after the point of overlap such that the frequency or angular change 
is not compensated. This is the key point in understanding the advantages of soliton dragging over other, symmetric, collision 

geometries. 





Chapter 2 

Optical Solitons 


Outside of the communications, switching, and computing applications, optical solitary waves and optical solitons (and solitons 
in ceneral) are interesting in their own right. An entire field of mathematics is devoted to the study of nonlinear integrable 
equations and their soliton solutions [131]. These nonlinear equations are associated with a linear scattering problem and time- 
e%(>lution equation. The bound-state eigenvalues of the linear scattering equation correspond to the soliton solutions and only 
those solutions remain in the asymptotic limit of large time (or long distance). The nonlinear evolution of an arbitrary initial 
condition IS described by decomposing into the linear eigenfunction basis set given by the scattering problem. These eigenvalues 
are then iso-spectrally evolved and transformed back to the original space using the inverse scattering transformation [73,161, 
1()21 This thesis will not make direct use of the inverse scattering transform because in general the multi-dimensional vector 
nonlinear wave equations employed are not known to be integrable [163] and can be solved instead via more simple numerical 
and approximate analytical techniques. 

Numerous examples of these integrable and non-integrable nonlinear wave equations exist in physics, but the examples 
from optics are most closely related to this thesis. There are many mechanisms of optical nonlinearity through which solitons 
or .solitary waves can form, many of which will be discussed in section 2.1. The most ubiquitous example is the optical 
Kerr nonlinearity (section 2.2), which gives rise to the integrable (1+1)-D nonlinear Schrodinger (or NLS) equation, in both 
spatial (section 2.3) and temporal (section 2.4) forms. The multi-dimensional NLS equation (section 2.5) is not known to be 
integrable j 16.3) and describes the propagation of optical waves with additional dimensions of linear and nonlinear behavior. 
With suitable modifications, the NLS equation serves as the basis for describing the soliton behavior necessary for the logic 
<:alCN studied m this thesis. 


2.1 Historical Background 

A solitary wave is a nonlinear wave phenomenon that propagates without change. The most notable recorded discovery of a 
.solitary wave iKcurred in 1834 when J. Scott Russell observed a solitary water wave in the Edinburgh-Glasgowcanal [164,165]. 
Through laboratory experiments. Russell deduced that the wave speed is proportional to its amplitude, i.e. taller waves travel 
faster than shorter waves. This wave phenomenon is termed a gravity wave. He also deduced that an arbitrary initial profile 
will asymptotically evolve into multiple solitary waves, and that two solitary waves, with the taller one overtaking the shorter 
one. will interact and emerge undistorted [ 165]. The.se solitary waves were later called solitons by Zabusky and Kruskal [166]. 
In 1871 and 1876. respectively. Boussinesq and Lord Raleigh showed that, by assuming that the length of the solitary wave is 
much greater the water depth, such a solitary wave has a scch’o amplitude profile. At this point it was realized that the length 
of the solitary wave is inversely proportional to its amplitude, such that a shorter wave is longer than a taller wave. 

The governing nonlinear wave equation for shallow water waves was derived by Korteweg and deVries in 1895 [167], now 
known as the KdV equation; 

du , du d^u ^ 

written in normalized form where u is the normalized amplitude of the wave as a function of position and time. In 1955, 
Fermi. Pa.sta. and Ulam [ 168] studied a numerical model closely related to discrete KdV, and in 1965, Zabusky and Kruskal 
numerically solved the KdV equation with periodic boundary conditions [166] and found that two or more KdV solitary waves 
do not break up upon collision. This particle-like nature led them to coin the term soliton . In this thesis, the term solitary 
wave is used when it is not known that the nonlinear wave is indeed a soliton. 
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The first study of solitary wave phenomena in optics was the self-trapping of one- and two-dimensional filaments [169]. 
This was one study out of a large body of theoretical work that arose from experimental observations of beam filamentation 
due to self-focusing in liquids [170], bulk glasses and crystals [171,172], and atomic gases [173]. Self-focusing was also 
determined responsible for anomalously low stimulated Raman scattering threshold intensities [174]. It was quickly realized in 
numerical studies based on the radial MLS equation [175] that two-dimensional nonlinear propagation could become unstable 
when diffraction does not balance self-focusing, leading to a focusing singularity. In fact, in two and three dimensions, the 
operating point representing the stationary solution where nonlinearity and diffraction balance is unstable, and results in the 
well-known critical collapse or blow-up phenomenon due to form instability [176], referred to as large-scale self-focusing [177] 
in two spatial dimensions. At some point in the process of focusing to a point, the beam breaks up into many filaments giving 
rise to small-scale self-trapping [178,179] and is a result of modulational instability such that small transverse amplitude or 
phase perturbations cause a plane wave (or large scale filament) to decay into individual beams whose subsequent propagation 

dynamics depend on the scale of the perturbation [178]. 

Blow-up is an unphysical manifestation of the multi-dimensional NLS equation and indicates that the original assumptions 
used in the derivation of the equation break down. One mechanism of arresting this behavior is to include saturation of the 
nonlinear index [180,181], resulting in stable self-trapping, thus explaining the stability of small-scale filaments. Another 
numerical study [179] showed the formation of small-scale filaments by following the self-focusing of a beam and subsequent 
breakup into an annular structure with the center portion resembling a small-scale filament. This ring structure was a result 
of the analysis being based on a radial nonlinear wave equation and by symmetry is the only structure possible, but will be 
indicative of an experimental situation in an isotropic medium in which strict radial symmetry of the initial conditions is 
maintained. The full picture of filamentary structure via transverse instability can only be captured with a two-dimensional 
analysis [ 178]. It was also realized that other higher-order nonlinear effects such as Raman and Brillouin scattering arrested 
collapse. The vector nonlinear wave equation derived in Chapter 3 includes non-paraxial terms which should also be included 
in the study of two-dimensional self-focusing, and will always prevent self-focusing to transverse cross-sectional areas smaller 
than order 

In 1972. Zakharov and Shabat [73] published the seminal paper on the inverse scattering transform for the (I-h^-D spatial 
optical NLS equation. A year later. Hasegawa and Tappert [132] derived the (1+1)-D temporal NLS equation for single-mode 
optical fiber and proposed the use of temporal solitons as data carriers for long-haul communications. It wasn’t until 1980, 
thouch. that the first temporal optical soliton was observed experimentally [182]. This experiment was made possible by 
the availability of low-loss optical fiber in the anomalous dispersion regime (i.e. wavelengths longer than about 1.28 ^/m in 
fused silica fiber) and the color-center laser producing ps pulses in that regime. More recent work has focused attention on 
the modifications of the base NLS equation required to adequately describe long distance, short-pulse propagation in fiber. 
These modifications include: higher-order dispersion [30], such as third- and fourth-order; optical shock [183] (also called self- 
steepeninsi or the intensity-dependent group-velocity [184]); and stimulated Raman scattering [185,186] (responsible for the 
soliton seFf-frequency shift [145,187]). Spatial optical solitons have also been realized experimentally in liquid CS2 [188,189], 
gla.ss [118.120] and AlGaAs [190] slab waveguides, and in CS: cells using highly elliptical beams [191] to avoid critical 
collapse inherent in (2+1 )-D nonlinear propagation [175]. Much more theoretical and experimental attention has been paid to 
temporal solitons. though, due to their potential for important applications, such as long-distance fiber communications. 

The previous paragraphs discussed optical solitons formed through the Kerr nonlinearity, but there are other nonlinear 
mechanisms in optics that also allow for solitons or solitary wave behavior. It has been known since the early 1970s that a 
third-order like nonlinearity can be obtained by cascading two second-order nonlinearities, such as occurs with the interaction 
between a fundamental and a field generated through DC rectification [ 192] or a generated second-harmonic [193]. The former 
mechanism has been shown theoretically [ 194,195) to be described by the Davey-Stewartson (or Benny-Roskes) equation that 
exhibits stabilized 2-D spatio-temporal solitons [ 162], but to date, these solitons have not been observed experimentally. The 
latter mechanism has generated a new field of study in nonlinear optics as a means of producing extremely large, ultrafast third- 
order nonlinear effects [196). Second-harmonic cascading been shown theoretically [197,198] and experimentally to support 
bright spatial [ 199] and temporal solitons. 

Other nonlinear mechanisms that support solitons include: formation of temporal solitons in the Stokes pulse via stimulated 
inter-pulse Raman scattering (SRS) [200,201). dark 1-D ]84] and stable 2-D vortex [202] spatial solitons in self-defocusing 
media with thermal nonlinearity, plasma filaments in air [203], and so-called photorefractive solitons [204]. These nonlinear 
mechanisms do not respond on the fs time scales of the Kerr and cascaded nonlinearities and, as a result, may not be as useful 
for ultrafast optical switching. 
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2.2 Preliminaries 

This thesis concentrates on bright one-dimensional spatial solitons and multi-dimensional spatio-temporal solitary waves in 
media exhibiting optical Kerr nonlinearity. The entire class of dark solitons are omitted from study because they have nonzero 
boundary conditions at infinity, implying the requirement of infinite power/energy. This boundary condition can be relaxed 
somewhat by embedding a dark stripe or vortex within a localized bright diffracting background. A multi-dimensional hybrid 
bright/dark solitary wave, or symbion, may also exist for either bright in time and dark in space (which might be important for 
materials with n. < 0 and normal dispersion [205]), or bright in space and dark in time (which might be important for materials 
with 112 > 0 and normal dispersion). 

In one dimension of space, a spatial soliton exists through the balance between linear' diffraction and nonlinear self- 
focusing [73,169). In time, a temporal soliton exists through the balance between linear dispersion and nonlinear self-phase 
modulation [132]. Kerr-type (nonresonant and non-dispersive) refractive nonlinearity is responsible for both self-focusing and 
self-phase modulation and is the fundamental requirement for the so-called nonlinear Schrodinger (NLS) solitons in optics 
which are the subject of this thesis. In one dimension, NLS solitons are stable, meaning that small amplitude or phase pertur¬ 
bations do not upset the balance between the opposing linear and nonlinear effects. 

Under special conditions on the amplitude and beam width/pulse duration, optical solitons can form when the total refractive 

index is well-described by the expression 

« = «o + «2|Ap, (2-2) 

where iiq is the linear index of refraction at some frequency coq and ni is the total ultrafast nonlinear index, or Kerr index, with 
units of cm-/V^ and A is the electric field envelope with units V'/cm^. The definition of the nonlinear index in terms of the 
third-order susceptibility, given by equation 3 . 59 , was chosen such that the above expression for the total refractive index is 
valid. A further generalization includes the quintic contribution to the refractive index through ultrafast , 

/I = Wq -I- W2 1 (2-3) 


where the effective quintic index, with units cm''/V^, is defined as 


= «4 - 


”2 

2 «o’ 


' (2.4) 


ignoring the ca.scaded contribution. The direct quintic index /14 is discussed in section 3.1.3. In this case, the relative dielectric 

constant takes the form ^ 

e =/I'= wq- 1 - 2 / 10 / 12 |A|‘- 1 - 2 /ioM 4 l-^r H—• (2.5) 

It is now seen that the second term in the definition of /i^^ is the result of taking the square-root of the dielectric constant to 
obtain the refractive index. An additional contribution, produced through the process of cascading between the fundamental 
and third-harmonic. is derived in the multiple-scales analysis of section 3.2. 

The quintic nonlinearity will be used in the multi-dimensional solitary wave case as a means for providing stability, but first 
1-D spatial and temporal solitons as described by the cubic NLS equation will be discussed in detail. 


2.3 1-D Spatial Optical Solitons 

Spatial solitons result from the balance between linear diffraction and nonlinear self-focusing as shown in Figure 2.1. In one 
transverse spatial dimension, these solitons are stable such that the balance between the linear and nonlinear effects prevents 
small amplitude or phase perturbations from destroying the soliton. If a perturbation acts to widen the soliton, nonlinear self- 
focusing overpowers diffraction to restore balance. In the opposite case, if a perturbation narrows the soliton, linear diffraction 
overtakes self-focusing. 

The analytical study of spatial solitons begins with the scalar Helmholtz equation which describes the propagation of a 
monochromatic beam in weakly inhomogeneous media 

V-A + k^A = 0 (2.6) 

where £ = iA(.r.,v.c)e“'‘““' -fee is the time-harmonic electric field, k = O)o/i(r,0)o)/c is allowed to have weak spatial variation, 
and the over bar indicates the presence of a fast phase variation due to propagation. Assuming for the moment that k has no 
spatial variation, equation 2.6 can be transformed into the spatial frequency, or Fourier, domain 

A- ky-i- kt^ A = I^A, 


(2.7) 
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diffraction self-focusing soliton 



Figure 2.1: Illustration of the balance between linear diffraction and nonlinear self-focusing in the formation of a spatial optical 
sohton. The 1/e- beam radius is plotted along with the dashed lines which represent the phase fronts. 


where A.,, and are spatial frequency variables representing the Cartesian components of the optical wave vector. The 
solution to this equation is 

y^/t;+A; + lt? = Ao. (2.8) 

which describes the momentum sphere of isotropic propagation and prescribes one spatial frequency component when the other 
two are known. This sets up an initial-value problem for propagation, such that when the electric field is known on a plane 
for example, guaranteeing that two of the three components of momentum are known, propagation to another parallel plane 
at an arbitrary distance is then fully determined by specifying the third component using equation 2.8. If c is the direction of 
propagation, then the initial-value problem has the solution in the Fourier domain 




where,4(/t;,>./.v.O) is known by the Fourier transform of the initial field on the input plane. The field distribution A (x.y.z) at the 
observation plane is calculated by the inverse Fourier transform of A(k,,ky,z). Linear diffraction is described in more detail in 
section 2.3.2 and Chapter 4. 

In media of i.soiropic symmetry class with Kerr-type nonlinear index /12, the inhomogeneous wave number is 


k-(\A\)=k^ + 2l4^\A\- 
«o 


where the quintic nonlinear index is neglected. Substituting into equation 2.6 results in 

V=A+Jt5A + 21t5^lAl‘A = 0. (2.11) 

no 

This equation describes the (2+l)-D spatial propagation of a beam in a weakly inhomogeneous, weakly nonlinear medium. 
Since the 1-D spatial soliton is of present interest, it is assumed that the beam is confined in they dimension by a slab waveguide. 
As a result, assuming that the linearly guided and nonlinear soliton envelopes are separable, i.e. 

A(x,y,s) =‘I>(y)A(jr,z), (2.12) 


where <I>(y) represents the guided profile and is real and of unity magnitude, the nonlinear Helmholtz equation 2.11 reduces to 


h'A d~A ,2 7 „,•> 1 4 12 7 n 
3 -T + 3 "^-h + 2^0 — A — 0 
az- ojr‘ no 


(2.13) 
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and is the starting point for the following analysis of 1-D spatial solitons. 

Note that the effects of waveguide confinement have been formally neglected in equation 2.13. This omission can be 
remedied by using an effective wavenumber and nonlinear coefficient determined by the appropriate modal overlap 
integrals, but the analysis in the next two sections neglects the effects of transverse waveguide confinement for simplicity. The 
fully confined case of 2-D spatio-temporal solitary waves in a slab waveguide is discussed in full in Appendix C. 

2.3.1 The Non-Paraxial Fundamental Soliton 

A soliton is by definition a stationary solution, therefore, using the field amplitude ansatz 

A{.v,z) = (2.14) 

in equation 2.13, the stationary transverse envelope becomes the eigenfunction of the ordinary nonlinear differential equation 


d-Aijc) 


kQ — ^~ + 2kQ-^\A{x)\~ A(x) — 0, 
«0 J 


where P" is the eiaenvalue and P represents the total wavenumber and contains both linear and nonlinear contributions. 

The transverse envelope is assumed to be of the form [169] 

A(jc) = Ao sech > (2-16) 

w here .-td is the real amplitude and wq is a measure of the beam width. Since the soliton is non-diffracting, there is no transverse 
pha.se variation. The stationary solution is given by 

= S'”) 

A:oH’o V '^2 

P^ = ^ + -L = ;-5[i.f^l, (2.18) 

w iih the tiiia! wavenumber |3 % ito -{-k/nzAl/l when the induced nonlinearity is small. 

Since equation 2.13 is invariant under the unitary transformation 

’ _ r COS0 sinG 1 [ 1 (2.19) 

r' I -sinG cosG c 


a more general stationary solution is given by 


=Aq sech 


cosG y — sinG; 


^/p[cos0 c^+sinOV] 


Equation 2.20 is a fully linear and nonlinear non-paraxial solution when the optical field polarization is in the y direction such 
that vectorial effects are absent. Unfortunately, it is mathematically and numerically difficult to propagate this (or other) solu¬ 
tion using the evolution equation 2.13; therefore, a common practice is to reduce this full Helmholtz equation to a parabolic 
one so that robust numerical techniques based on finite-difference or split-step methods, and analytic techniques such as the 
scattering/inverse-scattering transform, can be used. This procedure is carried out formally in Chapter 3 for the fully vec¬ 
torial (3+l)-D and (2+l)-D evolution equations which uncovers the linear and nonlinear physics buried in the second-order 
equation 2.6. but done in an ad-hoc manner in the next section for the (l-i-l)-D case. 

2.3.2 The Paraxial Fundamental Soliton 

Paraxial propagation is described about some mean direction of propagation. For simplicity, choosing z to be this direction, the 
electric field envelope is assumed to be of the form 


A(a,z) -A{x,z)e'^ 
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where ko = is the bulk propagation constant or wave number. The fast phase due to propagation is explicitly removed 

from the transverse envelope so that the slowly-varying amplitude (SVA) and paraxial approximations can be made. Substituting 
this expression into the nonlinear Helmholtz equation 2.13, and making the approximation 


\d^Aldz^\ = 


2ko 


d^A . d^A , 


dzdx^ 


j + 2ko 


«o 


dz 


< \2lcodA/dz \, 


( 2 . 22 ) 


results in the (1+1)-D spatial NLS equation 

2 //!„|l + |^ + 2 * 5 -|ApA = 0 . (2.23) 

dz ax- no 

The first two terms in equation 2.23 simply describe linear diffraction in the paraxial approximation which is seen by taking 
a transverse spatial Fourier transform 

M = = (2.24) 

dz 2ko 

and reduces the z-directed accumulated phase from the maximum value /:o to ^ [l - ^x/ 2 ko] due to the off-axis projection of the 
phase-velocity onto the mean direction of propagation z. This expression only allows for propagation in one direction (along z or 
very near to it) and is the result of the using the slowly-varying amplitude approximation which reduces the Helmholtz equation 
to a uni-directional wave equation. The SVAA is valid when backscattered radiation from linear or nonlinear inhomogeneity 
can be neglected to a given order of approximation, as discussed in Chapter 3. 

Figure 2.2 illustrates diffraction in spatial frequency space. For full (2+l)-D spatial diffraction in isotropic space, the 
surface of allowed k vectors is a sphere. Diffraction in (l-t-l)-D as shown in the figure is represented by slicing the sphere with 
a plane passinu through the origin, resulting in a circle. The paraxial approximation represents this circle as a parabola and 
is valid for small transverse spatial frequencies (small angular bandwidths). Linear propa gation i s described by appropriately 

phasing each transverse spatial frequency component A(kx) with exp(i7:;z), where k- = m the non-paraxial case and 

5 ^ A(, - A 7 / 2 A 0 in the paraxial case. 

Figure 2.3 shows the diffraction of an initial sech(.r/wo) profile beam over 5 confocal distances. Here the confocal distance 
Z) = is twice the Rayleigh range, which represents the distance over which the FWHM intensity pattern increases by 

\^2 K Note that the intensity FWHM is 1.7627vvo [30]. 

Neglecting the diffraction term in equation 2.23 results in the nonlinear equation 

i^A + kjn2\A\-A^0, (2.25) 

dz 

where A ; = cdo A* is the free-space wavenumber. The solution to this equation is written in the real space domain as 

= (2.26) 

with the nonlinear phase accumulation 

^^‘-(■^.z) = kfn2j^\A{x.z')\-dz'. (2.27) 

For small c = Ac. the nonlinear phase is approximated by the expression 

Ac) % A///: |A(jc, Ac/2)|“ Ac (2.28) 

which is second-order accurate in Ac- This nonlinearly induced phase is illustrated in Figure 2.4 for propagation distance Ac 

and peak amplitude chosen such that the maximum nonlinear phase is 4 radians. 

The spatial frequency space is the natural domain in which to solve the linear diffraction problem, while real space is the 
natural domain in which to solve the nonlinear problem (ignoring diffraction). This insight will be used in the development of 
the split-step Fourier method outlined in Chapter 4, which efficiently implements nonlinear propagation by switching between 
the two domains for linear diffraction and nonlinear refraction. 

‘ Unlike the case for Gaussian beams, in the case of a sech(). this distance is not the same as that obtained by interpolating the far-field diffraction pattern, 
which results in a confocal distance 1.60571^ Wq/X 
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Fieure2.2: k-space diagram graphically showing the c-projected wavenumberas a function of the transverse wavenumber 
The paraxial approximation is illustrated by the dashed curve. The transverse spatial frequency spectrum of a beam is shown in 
order to illustrate propagation by associating C with every A', of the spectrum. The paraxial approximation is good for small kx 
but breaks dow n for larger A, where the associated value for k- deviates significantly from the exact result. 
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-initial 



Fiiiure 2.3: Diffraction of a beam of initial sech(.Y/no) profile, and intensity profile sech-(x/wo}, where hq - 14.2 pm and 
intensity RVHM 25 pm. After propagating 5 Zq, where Zn = n-w^/k = 1.85 mm is the confocal distance, the intensity FWHM 

broadens to 140 pm. 



-200 -100 0 100 200 
transverse distance (in um) 


Figure 2.4: The nonlinear phase (dashed curve) induced by an electric field envelope (solid curve). The induced phase is 
proportional to the local intensity and in a material with positive Kerr nonlinearity, tends to reduce the phase velocity of the 
central portion of the beam resulting in a focusing, or lensmg. eflect. 

Now, looking for a paraxial soliton solution, the stationary ansatz 2.14 is substituted into equation 2.23 with the result 

^^ + 2/;„[-p+V:l'iWl’l^W = o- <2.29) 

rf.V- s- J 

Assuming the fundamental soliton form of equation 2.16. the amplitude and phase become 



(2.30) 
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Figure 2.5: Spatial soliton beam of initial sech(.v/vvo) amplitude profile, where wq = 14.'2 pm and intensity FWHM 25 
Fvcn after propagating 5 Zq, where Zq = 1.85 mm, the beam width and position are unchanged. 


|3=1/2^ovv’5 = V^2A5/2, 


(2,31) 


where the amplitude is the same as in the non-paraxial solution and P represents the slow phase nonlinear correction to the 
fast phase linear wavenumber ko. The total wavenumber ko + P is the same as that obtained in the non>paraxial case given by 
equation 2.18 when wq > The nonlinear correction to the wave number P is half that obtained for a plane wave of amplitude 
and simply reflects the fact that the soliton does not have constant amplitude. 

The full paraxial solution is written 


ko^'oy nz V^^’O 


,i[ko^\/2koul]z 


(2.32) 


where the linearly guided profile 0(y) and linear wavenumber ^tre added for completeness. Spatial soliton propagation using 
the split-step numerical scheme of section 4.2.1 is shown in Figure 2.5 with wq = 14.2 pm. After a propagation distance of 
5 Z(). the soliton is unchanged, thereby beating the limits imposed by linear diffraction as shown in Figure 2.3, which would 
result in spatial broadening by a factor of 5.6. 

Defining the optical intensity with units W/cm“ 

,/(-v,y.c)=^|A(.r.v.:)|'=^|/l(.v.y.z)|', (2.33) 

and neglecting the nonlinear contribution to the index and impedance, the intensity of the fundamental soliton is written 

/(.V, V. :) = d>“(y) sech“ ^ , (2.34) 

Ikjw-nz \vvo/ 


for fiz in units of V-/cm-. The power of the soliton is 


(2.35) 


and has units of U'. Note that / sech^(.r/H’o)^/y = 2u*o. 

From these expressions, it is clear that the peak intensity of the soliton increases inversely proportionally to the square of 
the width and the power of the soliton increases inversely proportionally to the width, so that a narrow soliton requires more 
power to launch and propagate than a broader one. 

A further generalization of the paraxial soliton solution is given by 


A(.x,z) =/lo sech 

\ wo J 


(2.36) 
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Substituting this expression into the NLS equation 2.23 results in 


1 

° " kowo V «2 

(2.37) 

0 = 5*,/^ 

(2.38) 

6 - 1 

P Ikow^ 2ko’ 

(2.39) 


which is the paraxial version of equation 2.20. Notice that, like the non-paraxial soliton solution, the NLS soliton solution has 
two free parameters; the width wq and angle of propagation 0. 


2.4 1-D Temporal Optical Solitons 


Temporal optical solitons result from the balance between linear dispersion and nonlinear self-phase modulation (SPM). In 
order to arrive at this one-dimensional system, the temporal soliton is confined transversely in an optical fiber. Even though the 
nonlinearity of silica glass is very small, optical fiber is an ideal system in which to perform experimental studies because of the 
very low linear loss ~ 0.2 dB/km (~ 5 x 10“'' cm“') and long interaction lengths [30]. Because a temporal soliton is inherently 
a one-dimensional nonlinear phenomenon described by the NLS equation, stability is guaranteed as it is in the one-dimensional 
spatial case. 

The scalar wave equation describing temporal propagation in fiber is 


d-A 

dz- 


+ it- 



91 

coo-f/^ 


|A|‘^ = 0, 


(2.40) 


where the electric field £ = iA(x,y,z,t)e~'^ -hcc and Raman scattering has been neglected. The wave number function of 
the operator ^ is interpreted by its Taylor expansion [194], as shown in equation 2.44. Here, parenthesis enclose function 
arcuments. while braces and brackets are used for grouping. The effects of Raman scattering are considered in section 2.4.3. 
Equation 2.40 describes the evolution of the pulse envelope, therefore soliton solutions are termed envelope solitons, in contrast 

to field soliton solutions of other nonlinear wave equations such as KdV. _ 

in the following analysis it is assumed that A has the separable form A = <l)(.r,y)/4(z, t), where is the transversely guided 
profile of unity magnitude The effects of transverse guidance will be neglected for simplicity, as already done in equation 2.40. 
In .Appendix C. the derivation in the case of spatio-temporal propagation in a slab waveguide is presented in full. 

The linear part of equation 2.40 has the simple solution in the temporal frequency domain 

A(A(0,c) = e*'*'"‘o'''^‘“’'/4(A(0,0). (2.41) 


The term Ao) describes some (small) frequency variation about the central frequency (Uq. Similar to the case for linear diffrac¬ 
tion. linear dispersive propagation is fully described by phasing each temporal frequency component A(coo-1-Aco) using the 

appropriate dispersive wave number -I-Aco). 

The variation in wavenumber k with wavelength is shown in Figure 2.6 for bulk silica glass, where waveguide dispersion is 
neglected. The linear and nonlinear optical properties of silica are discussed in more detail in Appendix D. Also shown is the 
wa'velength dependence of the group delay coefficient, w hich is related to the wavenumber by k' = dk/d(o, and can be evaluated 
in two forms 


k‘((n) = ^ 


/i(to) -f- 


tp 3/i(io) 
c du) 



«{X)-X 


a«(X)' 

dX 


(2.42) 

(2.43) 


It is clear from the figure that when X > 1.28 pm, shorter wavelengths are delayed less than longer wavelengths, meaning that 
shorter wavelengths (higher frequencies) travel with a greater group velocity than longer wavelengths. This is the anomalous 
dispersion regime and is the regime in which bright temporal solitons can form with positive Kerr nonlinearity. 

Much effort has been placed on using waveguide dispersion of fiber to tailor the overall dispersion characteristics. One 
example of this is dispersion-shifted fiber, in which the zero value of group-delay dispersion is shifted to longer wavelengths, 
particularly near 1.55 pm where the fiber loss is minimum. Double or quadruple clad fiber has flattened dispersion character¬ 
istics where the ODD is small and nearly uniform over a 25 nm range [30]. Both of these examples are used mainly for linear 
propagation where dispersion is not compensated by SPM. Appendix D briefly covers the effects of waveguide parameters on 
the overall linear dispersion relation. 
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Fiiiure 2.6: Wave number and group delay versus wavelength for bulk silica. The dashed vertical line separates the normal 
(NGDD) and anomalous (AGDD) dispersion regimes. The spectrum of a 16.5 fs intensity FWHM sech() pulse with time- 
bandwidth product 0.316 is also shown. Linear dispersive propagation occurs by phasing each temporal frequency component 
using ihe appropriate wave number. 



2.4.1 The Non-SVEA Fundamental Soliton 

It is not known that equation 2.40 has a general solution. There are solutions for specific material parameter values when the 
full dispersion relation il:-(co) is truncated at a given order. This truncation amounts to making the slowly-varying envelope 
approximation (SVE.M. although the standard SVEA keeps terms only up to second-order in time derivatives and results in the 
temporal NLS equation. 

.'\n analytic solution to equation 2.40 has been obtained [206] keeping linear dispersion terms up to fourth-order in time 
derivatives and all nonlinear terms, excluding Raman scattering. To fourth order, the linear dispersion is 

k~ l^ + ~ikok^-^ - [^' + /:o^o] ^ (2-44) 

With this approximation, equation 2.40 can be rewritten 

p ~ .r = 0. 

Notice that the SVAA has not been made, even though the SVEA has been made to fourth-order in time derivatives. Equa¬ 
tion 2.45 will still be referred to as a non-SVEA equation because it includes two orders of time derivatives beyond the standard 
SVEA equation. 

A solitary wax-e solution to equation 2.45 is given by [206] 


/4(;./) = Ao sech 
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where the expressions for the quantities Aq , v, Tq, P and 5co can be found in the reference. One important property of the solution 
is that all of the parameters are determined by the material constants; there are no free parameters so only one solitary wave 
profile can be a self-consistent solution. This is unlike the NLS case in which there are two free parameters - width/duration 
(or amplitude) and propagation angle/frequency [117]. A condition on the existence of the solution 2.46 is that the material 
dispersion parameters k!^ and are negative, which can be accomplished using quadruple-clad fiber for example [207]. 

Another property of this solution is that, like the non-paraxial spatial soliton solution of the previous section, there is no 
backscatter of radiation from the induced inhomogeneity due to the nonlinear index. This is because the wave is a stationary 
state and does not change with propagation, therefore there is no energy loss due to scattering. The stability properties of this 
non-SVEA solution have not been studied to date, and it should be pointed out that experimental realization may be difficult 
due to the restrictive nature of the solution. 

2.4.2 The SVEA Fundamental Soliton 

Writing the field amplitude in terms of a fast propagation phase and slowly-varying envelope, 

A{z.t)=A{z,t)e‘^, ( 2 - 47 ) 

and substituting into equation 2.40, the (l+l)-dimensional scalar nonlinear wave equation usually considered for soliton prop¬ 
agation in optical fiber [132] is obtained 

+ = (2.4S) 

dc d/ J df" tiQ 

where the slowly-varying amplitude and envelope approximations have been made. The quantity in brackets simply indicates 
that the wave nominally moves at the group velocity Vf. = 1/^0- This quantity can be simplified by making the transformation 
into the reduced time coordinates 

T = i-kkz (2-49) 


and notine that 


+ = + l (2.51) 

5z~S:3T BzBZ ^dT dZ 

With these .substitutions, equation 2.48 can be written 

+ = (2.52) 

dc d/ -no 

which is the temporal nonlinear Schrodinger (NLS) equation. Apart from a scaling factor, this equation is the same as the 
(1 + 1)-D spatial NLS equation. For bright soliton solutions analogous to equation 2.32, it is clear that Aq < meaning that the 

wave propagates m the anomalous dispersion regime. ^ 

By using the transformation 2.49, the coordinate system is changed such that the pulse moving with the group velocity l/k^ 
is stationary on the reduced lime axis T, instead of moving on the time axis / at the group velocity. This results in the leading- 
order frequency-dependent correction to the wavenumber ko being proportional to the group-delay dispersion coefficient Icq, 
or , 

k(io) ^ ko + —A(0‘’k^y (2.53) 

where A:o = mo) and Aco = Q) — 0 ) 0 . 

The linear part of equation 2.52 has a simple solution in the temporal frequency domain 

A(A(o,z) = c'K^“'/-]'A(Aa),0). (2-54) 

Analogous to paraxial linear spatial diffraction, linear dispersion is described by phasing each temporal frequency component 
with the appropriate quadratic phase factor in this case where SVEA was made such that higher-order dispersion is 
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reduced time (in ps) 


Fisure 2.7: Effect of negative group-delay dispersion on a 16.5 fs quasi-monochromatic wavepacket with A/ = 1.55 ^m. The 
initial sech{) packet is shown on the left and the dispersed packet on the right. The electric field oscillation at optical frequencies 
is shown for illustrative purposes. 


1 .O ^ 



Figure :.S Di.spersion of a pulse of initial sechCT/io) amplitude profile, where Tq = 9.36 fs and intensity FWHM duration 
\b.5 fs. .Alter propagating 5 2(), where Zq = 7tT5/2|/:o| = 4.93 mm = ILo/n is the dispersive confocal distance and Lq is the 
standard dispersion length, the pulse FWHM broadens to 91.5 fs. 

nesilceied The dispersion of a wavepacket is illustrated in Figure 2.7 in the time domain. Even though group delay dispersion is 
an en\eU>pe phenomenon, the local phase of the envelope affects the underlying carrier. The figure indicates that in the dispersed 
envelope, higher frequencies move towards the pulse leading edge (negative time) and lower frequencies move towards the 
trailing edge (positive lime), characteristic of the AGDD regime. 

Figurc"2.8 shows the effect of group-delay dispersion on a 16,5 fs FWHM pulse envelope over the dispersive equivalent 
distance of 5 Z,). Note that higher-order dispersive effects must be considered for this wavepack et, but are presently neglected 
for illustrative purposes. The definition of Zq is obtained by using the substitution Tq —>■ ^o\/^Wq\ iti the definition of the 
confocal distance used for spatial diffraction of the scchix/w^) profile. With the consideration of only group-delay dispersion, 
the broadening of the temporal profile depends on the magnitude of GDD, not the sign. The chirp of the carrier does depend on 
the sign of GDD, as shown in Figure 2.7, 

The nonlinear part of equation 2.52 has the solution 

A(T,z) = e‘^'-^^''^A(0) (2.55) 

9 ^'HT,z) = kf,12 \A(T,z')\-dz'. 


with the nonlinear phase accumulation 


(2.56) 
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Ficure 2.9: Effect of positive self-phase modulation on a 16.5 fs quasi-monochromatic wavepacket where the initial sech() 
packet is on the left and the nonlinearly modulated packet on the right, with Tt peak nonlinear phase change. Positive self-phase 
modulation reduces the phase velocity proportionally to the square of the field envelope resulting in an instantaneous frequency 
shift (top right), such that the pulse leading edge is downshifted and the pulse trailing edge is upshifted. 

Because of the nonlinear phase modulation, there is an instantaneous frequency change across the pulse envelope. This fre- 
quenev shitiis defined as ^ 


5co = -* 


-kffi^Az 


d\A{T,Az/2)\- 


dT ’ 

and IS known as self-phase modulation (SPM) [30]. This effect is illustrated in Figure 2.9. The frequencies at the center of the 
pul.se are not shifted because the derivative there is zero. At the inflection points of the envelope the derivative is maximum 
resulting in the largest frequency shift such that the leading edge is downshifted while the trailing edge is upshifted for positive 
SPM. Therefore, in the anomalous dispersion regime, the leading edge will travel with a slower group velocity than the trailing 
edee The net effect is a narrowing, or compression, of the pulse and a temporal soliton results from the dynamical balance 
between positive SPM and linear dispersion in the anomalous regime. This balance is the reason tor the existence of stable 
temporal st'litons. In the normal regime, pulse lengthening is enhanced due to positive SPM. 

It IS clear from Fiaure 2.9 that, even though the envek'pe amplitude is unchanged. SPM alters the pha.se and hence the 
spectral content. This phenomenon is known as SPM-induced spectral broadening and is shown in Figure 2.10 by comparing 
the temporal Fourier transforms of the two wavepackets in Figure 2.9. where significant spectral broadening has occurred. In 
the absence of asymmetric, higher-order effects, spectral broadening is symmetric about the central frequency. 

By analogy w ith the 1-D spatial soliton. the existence of a temporal soliton of the form 


A(r.:) = /fosech(^-je'P- , 

is postulated w here (I is the nonlinear wave number correction. Using this trial solution in equation 2.52 results in 

O'" (2.59) 

P=-A;;.'2t5 = A,/i;A3/2. (2.60) 

In order for the amplitude Ao to be real. Fq < 0. which is the anomalous dispersion regime as anticipated. By comparing the 
expression for the amplitude in the temporal and spatial ca.ses. there is a correspondence between the temporal duration and 
spatial width give by To = “'o, also indicating that k[\ < 0 in order for Tq to be real. 

The full SVEA temporal soliton solution is then 


(2.59) 

(2.60) 


A(.x,y.z.T)=-, 

To 




«I>(.v.y) sech 


i[ka-k"l2xl]z 


where the guided profile d)(.v.>') and linear wave number ko are added for completeness. The numerical propagation of a 
temporal soliton is shown in Figure 2.11 over 5 Zq. As for the spatial soliton, the pulse duration and position in the reduced 
time coordinate frame are unchanged. 
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I'murc 2.10: Self-phase modulation induced spectral broadening. The peak nonlinear phase change is u, resulting in spectral 
broadening by greater than a factor of 2. Subsequent pulse compression utilizing the appropriate net negative group-delay 
dispersion will result in a pulse shorter than the initial pulse. 



Figure 2.11. Temporal soliton pulse of initial sechlT/to) profile, where Tq = 9.36 fs and intensity FWHM duration 16.5 fs. 
After propagating 5 Zo. where Zq = 4.93 mm, the pulse duration and position are unchanged. 


The optical intensity is given by 


eoc/i(.r. v)A.o , ,2 f ^ ^ 

l{x,y,z.T) =- -■ . sech^ I - 1, 

IkfXQiii V^o/ 


(2.62) 


where the spatial variation of the linear refractive index responsible for transverse guidance is retained. The optical power of 
the soliton is 


' ’ Ikfxln. yto 


(2.63) 





Final Report AFOSR F49620-95-1-0431, Spatial Soliton Interactions, S. Blair & K. Wagner, U. Colorado, Boulder 


49 


and the effective transverse spatial area is defined by the expression 


: = — f n{x,y) ^-ix,y) dxdy. 
/m J J 


The total energy of the soliton is then 

^ £oc/2o/:()^Aeff 1-2.6 

” kfXQn2 

with the same scaling as the 1-D spatial soliton. 

As was done for the spatial paraxial soliton solution, the temporal SVEA soliton solution can be generalized to the form 


A(c,7') sech 


T-bl^z\ ;m;_ 


,/[ 3 :- 8 (or]' 


Substituting into the temporal NLS equation 2.52 results in 

(2.67) 

/iQ 1 

S( 0 -/:o ^ 2 . 68 ) 

^ 2 X 5 2 

5A(, = 5o)/t(,'. (2-69) 

This result simpiv means that a change in optical frequency 5(0 changes the linear wave number from ko to fco + 6(0-/.-o/2 and 
the group delay from k'^ to k'^ + boik^. Notice that the amplitude is unchanged which is the result of neglecting the third-order 
dispersion contribution k^ in the NLS equation. 


2.4.3 Higher-Order 1-D Temporal NLS Solitons 


Takimi the temporal nonlinear wave equation one order of approximation beyond SVEA results in the modified NLS, or mNLS, 
equation 

0 1^^ I ' I (2 70) 

-1-2^0^ + 4/Vi; - yj -^ = 0. 

A similar equation'was previously considered [140] but lacked the correct form of the shock term. As shown in Chapter 3. 
this is the order at which the effects of stimulated Raman scattering also appear for the given scalings. Raman scattering is a 
dissipative effect and is considered in the next section on higher-order temporal effects. 

A solution to the mNLS equation is given by 

A{z.T)=Ao sech ^ ^ ^ ^ (2.71) 


with the following parameter values 


6kfn2xl[no/c-k^/2] 

P * ft" I I I 

P = [^0 + oa)k^J j -f — -h - 

okQ = ocoAq + —- ^ 


^-4lno/c-k',/2j 2k^'- 

The only free parameter is the pulse duration Xq, which allows for multiple solitons of different durations/amplitudes to be used 
for switching. Stability has not been investigated theoretically, but numerical simulation verifies that the solution propagates 
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stably. Note that this solution is a generalization of that obtained previously [140] in which temporal walkoff in the reduced 
time coordinate frame was not allowed, resulting in no free parameters. This is explained further below. 

There are two special cases of interest in which SIcq = 0 and 6 (i) = 0. When 5^0 = 0, there is no pulse walkoff in the 
reduced-time coordinate frame. In this case there are no free parameters [ 140] since the pulse duration is now fixed at the value 

t2| = ^ (2.76) 

^o|8i'=o 6 [6(0)1" + ■ 


Another way of expressing this relationship is to determine the value of TOD needed to sustain a given pulse duration. There¬ 
fore, choosing the pulse duration Tq as a “free” parameter, the following conditions hold 




2 to 1 :o - ^J4klxl - 16 [no/c-^[,/ 2 ]' 
4To[no/c-^o/2 


,111 _ 

5t[|=0“ 


65(oAoT5 
35(0-T 5 - I ’ 


(2.77) 

(2.78) 


where the sign was cho.sen in front of the square-root as the proper solution which is verified by numerical simulation. 

The other interesting case is when 5(0 = 0, meaning that the original choice of center frequency (Oq is a solution to the 
mNLS equation. This can only occur when the material dispersion parameters satisfy the condition 


ko 


such that the change in group delay is written 

(2.80) 

Note that T„ remains a free parameter. The temporal walkoff in the reduced time coordinate frame can be eliminated by 
redefining the coordinate transformation as T = / — (ig -I- 5 Aq)c. 

A special case of mNLS, written in normalized form, is known to be integrable [208] 


"•018(0=0 


kotl 


dti I d-u , ,1 

'V+o^ + l"l “ + 'P 


d^u , , ,2 du 
dT 


becau.sc it is a member of the NLS hierarchy and has an associated scattering problem and inverse scattering transform. This 
equation is not of iniere.st here because, like the general mNLS solution with 5(0 = 0, it places restrictions on the material con¬ 
stants which may not be feasible to satisfy. Instead, this thesis deals with more general, non-integrable cases using parameters 
of materials currently in use for nonlinear optical switching. 


2.4.4 Higher-Order Temporal Effects 

The previous section considered additional non-SVEA terms to the NLS equation that still allowed for analytic soliton-like 
solutions. This section considers additional terms that result from the scalar, temporal reduction of the full evolution equa¬ 
tion derived m Chapter 3 for use in spatio-temporal propagation. The additional terms included here that are not present in 
equation 2.70 are fourth-order dispersion (FOD) and stimulated Raman scattering: 


T., 9-4 II d'A i 1 I 

-ifikr , 31/41*"/4 

-|-21fl—|/l|-/l + 4/V8'[-^-yJ 

kf r. 2 F/In kk} 3 1 , , 


, , no 

kf I — 


no I c 2 J 37- 


rR,fix)\A(T-x)\^ 

Jo 


Adx = 0. 


Here, k^' is the fourth-order dispersion coefficient, hk is the nonlinear Kerr index, and Rr{x) is the Raman response function. 
To date, there is no known analytic solution to this equation. As a result, the effects of these higher-order temporal terms on the 
fundamental NLS soliton are now discussed. 
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Ficurc 2.12; Etfects of linear dispersion on a sech() pulse with initial intensity temporal FWHM of 16.5 fs. Based on the 
dispersion properties of fused silica, the group-delay dispersion (GDD) coefficient Iq < 0, the third-order dispersion (TOD) 
coefficient A,'" > 0. and the fourth-order dispersion (FOD) coefficient Ig' < 0. The propagation distance in each figure is 5 Zq. 


Highcr-Ordcr Linear Dispersion 

First, the effects of higher-order linear dispersion will be considered in the absence of nonlinearity. In this case, equation 2.82 
reduces to _ ^ 

The group delay as a function of frequency, as shown in Figure 2.6, can be written (co) = Aq -f A/:'(Aa)), where the approxi¬ 
mation , 

AA^(A(o) ^ A(i)A[[ -f — Aco^Aq^ -h ^Ao^Aq^ , (2.84) 


is made and is cubic in frequency deviation about Wq. 

The effects of linear dispersion on a temporal wavepacket are shown in Figure 2.12. The top plot shows temporal broadening 
due to GDD alone, and is based on the same data used for Figure 2.8. The symmetry of the broadening can be understood by 
considering the change in group delay, which is linear in Ao). Therefore, symmetric deviation about the center frequency results 
in anti-symmetric deviation in group delay about Aq, such that 


A'( 0)0 ± Acu) Aq db AcoAq . 


(2,85) 


As a result, GDD leads to symmetric broadening, with red-shifted frequencies trailing blue-shifted frequencies in the AGDD 
regime. 
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Figure 2.13: Effects of higher-order linear dispersion on a temporal soliton with initial intensity temporal FWHM of 16.5 fs. 
Th'e inidal (dashed curve) and final (solid curve, after 5 Zq) temporal envelopes are shown on the left plot, while the initial 
(dashed) and final (solid) spectral envelopes are shown on the right. The soliton is delayed by 7.25 fs and narrows slightly to 
16.2 fs. 
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The second plot shows temporal broadening due to TOD alone. In this case, the group delay 

/:'(tOo±Au)) ft! (2.86) 

\vhich results in a symmetric change in group delay with frequency. This expression indicates that both red-shifted and blue- 
shitied frequencies will move towards the leading edge (TOD<0) or trailing edge (TOD>0). For positive TOD as shown, the 
peak u ill be delayed in the reduced time coordinate frame with steepening of the trailing edge and oscillations in the tail [30]. 

I he third p\o{ shows the effect of FOD alone. The group delay can be written 

± Aw) ± , (2.87) 

o 

which, like ODD. is anti-symmetric. Again, broadening will be symmetric, but with much stronger change in group delay for 
frequencies lar removed from (Oq. FOD can either enhance (same sign) or oppose (opposite sign) the broadening due to ODD. 
.-\s shown in the plot, the FOD coefficient of fused silica is small enough that its effect is negligible for the choice of initial 
pulse duration and propagation distance. 

Finall). the fourth plot shows the temporal broadening due to the combined effects of all dispersion terms. In fact, the 
full dispersion relation using the Sellmeier coefficients for fused silica (see Appendix D) is used, but the primary effects arise 
from ODD and TOD Because of TOD, broadening is a.symmetric. The signs of ODD «0) and TOD (>0) indicate that TOD 
opposes the change in group delay due to ODD when Ao) > 0, and enhances the change due to ODD when Ad) < 0, as shown 
in Figure 2 6 Therefore, the broadening of the red-shifted trailing edge will be increased while broadening of the blue-shifted 
leading edge will be reduced. Opposite to the case of TOD alone, the peak is now slightly advanced. 

The effect t>f higher-order dispersion on a temporal soliton is shown in Figure 2.13, as described by the evolution equation 


Ti I- 


* i r ^ 
3^oAu ^ 




where ri- = fo instantaneous nonlinear refractive index. Because AGDD is balanced with self- 

pha.se modulation, ilrie soliton is delayed by 7.25 fs in the reduced coordinates. This delay is primarily the result of positive 
TOD. and is similar to the case previously discussed with TOD acting alone on a linear pulse, where the delay was 7.08 fs. The 
peak of the spectrum is also downshifted, as shown on the right-hand plot, which corresponds to the delay experienced in time. 
Except for the delay, in real space, the soliton is mostly unaffected by higher-order dispersion, a testament to (l-i-l)-D stability. 


Optical Shock 

Optical shock |183] is the first-order time derivative of the nonlinearity and gives rise to an intensity-dependent group de¬ 
lay 1184]. The evolution equation describing nonlinear refraction and optical shock is 
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Figure 2.14: Effects of optical shock on a 16.5 fs wavepacket for a propagation distance of 1.5 Zq. The left-hand plot shows 
the initial (dashed) and final (solid) temporal intensity envelopes and indicates slight narrowing to 16.3 fs and delay of the peak 
by 5.68 fs. The right-hand plot shows the initial (dashed) and final (solid) spectral envelopes and indicates spectral broadening 
from 19.1 THz to 33.9 THz due to SPM with asymmetry caused by shock. 


The optical shock expression obtained here has two contributions, one (proportional to phase delay) directly from Maxwell s 
equation in the.time derivative of the nonlinear polarization, and one (proportional to group delay) due to the removal of the 
S\’l: approximation by one order. 
l:\prcssini: the field envelope as 

..\ = |4|c'^ (2.90) 

equation 2.89 can be written as the coupled differential equations 


^ = 6 — 
dz /to 


A,', 


d: 




no 

( 




no 

c 



(2,91a) 

(2.91b) 


The tiTNt cquaiion ha.s the simple solution 

\A(T.:}: = \M:-W^T)\. 

where the intensits-dependent group velocity is defined 

1 


‘ Igv — f » r 

n„ ' . I 


Mr 


(2.92) 


(2.93) 


Therefore, the group velocity is reduced with increasing intensity. 

The self-steepen me effect due to optical shtick is show n in figure 2.14. The peak is delayed more than the wings resulting 
m a steepening of the trailing edge. An estimate of the lime dela\ of the peak is 


AT = 


(I^ 

1 ig\ 



(2.94) 


which gives the value AT = 5.73 fs using the parameters of the simulations. This compares very well with the value of 5.68 fs 
obtained by numerically locating the peak. The .second plot shows that the spectral broadening due to self-phase modulation (as 
shown in Figure 2.10) becomes asymmetric, with slight down-shift of the peak corresponding to delay in the temporal domain. 

The .steepening of the trailing edge results in broad frequency content. Therefore, dispersion must be considered, which 
dissipates steepening. The following evolution equation well describes the effects of shock and dispersion 


3/1 ..„d-A 1 


2A(") — l/fl'/f -|- AikjHic 
"0 


no 

c 


k'o 


dT 


= 0 , 


(2.95) 
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Ficure 2.15: Effects of optical shock and higher-order dispersion on a 16.5 fs wavepacket for a propagation distance of 5 Zq. 
Th'e left-hand plot shows the initial (dashed) and final (solid) temporal intensity envelopes and indicates slight narrowing to 
15.8 fs and overall (centroid) delay of 13.7 fs. The right-hand plot shows the initial (dashed) and final (solid) spectral envelopes 
and indicates asymmetry due to shock and TOD. 


and IS used for the simulation results presented in Figure 2.15, using the fundamental soliton A-sech(7/To) as the initial 
condition. Dispersion dissipates the shock and the balance between AGDD and SPM stabilizes the pulse. As a result, the effect 
of shock IS to delay the entire pulse. The overall delay of 15.8 fs is due to both shock and positive TOD, where the delay without 
shock is 7.25 fs. as shown in Figure 2.13. 


Kumun .Scattering 

This section considers the full evolution equation 2.82. which includes higher-order dispersion, shock and Raman scattering. 
The Raman nonlinearity is de.scribed via the full Raman response function [186] for a single resonance: 


/f«(T) = 


R„c ■'^'-sin(Q/jT) 


(2.96) 


uhere Q/.- = y LI, - is the optical phonon frequency. Qj is the natural oscillation frequency and y is the damping constant. 
Raman scaiterine results in a continuous frequency down-shilt by providing gain at lower frequencies at the expense of higher 
Irequencies. as discussed in Appendix B. This effect is often referred to as the soliton self-frequency shift [ 145). In the AGDD 
regime, the downshifted frequencies travel with a greater group delay (slower group velocity) thereby delaying the wave in the 
reduced time ci'ordinates. as shown in Figure 2.16. 

Spectral narrowing and downshift are clearly seen on the right-hand plot. The peak of the Raman gain in fused silica is 
1.>.2 Til/ lor S2.'( rad/ps). The peak of the downshifted spectrum is about 80 rad/ps below the knee of the high frequency 
side of the curse, indicating that the low frequencies are amplified and the high frequencies are attenuated, resulting in spectral 
narrow tng Spectral four-wave mixing continuously creates new frequencies, called the Stokes (low-frequency) and anti-Stokes 
(high-frcquencs i side bands, allowing for a Raman downshift w hich exceeds the original spectral bandwidth. The manifestation 
of these effects in the temporal domain is temporal broadening (due to spectral narrowing) and delay (due to downshift). Note 
that the delay of 20.0 fs is only slightly greater than the delay of 13.7 fs without the Raman term. This indicates that, in this 
case, the Raman effect is about the same order as the other higher-order temporal effects, but because of the large, continuous 
spectral downshift, the Raman term will manifest itself over much longer distances and eventually dominate. 


2.5 2-D and 3-D Spatio-Temporal Solitary Waves 

The propagation and interaction of multi-dimensional spatio-temporal solitary waves, which are stationary in both space and 
time, is of ultimate interest to this thesis. These multi-dimensional nonlinear phenomena are termed solitary waves instead of 
solitons because, like solitons, they are stationary, but may not satisfy the additional properties of solitons such as inelastic 
collisions (which preserve the soliton eigenvalues) and integrability of the defining equations. Like temporal solitons in fiber, 
these multi-dimensional solitary waves are fully confined, either by complete nonlinear self-confinement in the case of the 3-D 
light bullet or with one dimension of linear confinement by a planar waveguide in the 2-D self-confined case, such that the 
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Fisure 2.16: Effects of Raman scattering, including shock and higher-order dispersion, on a 16.5 fs wavepacket for a propaga¬ 
tion distance of 5 Zq. The left-hand plot shows the initial (dashed) and final (solid) temporal intensity envelopes and indicates 
broadening to 27.4 fs and overall (centroid) delay of 20.0 fs. The right-hand plot shows the initial (dashed) and final (solid) 
spectral envelopes and indicates spectral narrowing from 19.1 THz to 11.7 THz and overall downshift by 4.23 THz. 


energy requirement is low. A brief discussion of 2-D spatio-temporal solitary waves is presented here with more detail on 
propagation and interaction provided in Chapter 6. 

The most straightforward means of describing the propagation of 2-D spatio-temporal solitary waves is the direct e.xtension 
of the (1-t-1 )-D nonlinear Schrodinger (NLS) equation 


dA d'A ,„d~A , i4|2 _ n 

2 ikQ-^-l- 3-T - kA) 571 +2^0 — 1^41 A — 0, 
dz o.v- oT- no 


(2.97) 


but It IS well known that the solitary wave solutions are un.stable to propagation [209). This instability can result in critical 
collap.se. or blow -up. when nonlinearity overtakes diffraction/dispersion.or in broadening when diffraction/dispersion overtakes 
nonlmearitv. In the case of (2-i-l )-D spatial and spatio-temporal in the AGDD regime, and (3-t-l)-D spatio-temporal in the AGDD 
reeime. blow-up is the result [ 176], which indicates that the evolution equation does not take into account additional physics 
w hich arrest this behavior. The inclusion of non-paraxial and non-SVEA terms results instead in spatio-temporal broadening. 

Since the pump must propagate the length of the logic gate undisturbed (i.e. with final size about the same as the initial 
sizet. mechanisms to stabilize propagation need to be investigated. Multi-dimensional propagation can be stabilized by the 
inclusion of a suitable nonlinear index saturation mechanism |2,210] such as ultrafast quintic nonlinearity ot opposite sign to 
the cubic Kerr nonlinearity, i.e. n = iiq + [A]' -h /ij^lAj where A is a scalar field and n^^ < 0 accounts for the retractive 

part of the effective quinii'c nonlinearity. Examples of this type of saturation behavior are the nonresonant nonlinearities of 
.semiconducting .-MGa.-Xs 1143) at sub-half bandgap or the organic single-crystal PTS [125.147). Ultrafa.st saturation from the 
quintic nonlinearitv is the stabilization mechanism discussed in this thesis, but it should be noted that the balancing between 
third-order dispersion iplus space-time focusing) and shock as shown in section 2.4.3 for 1-D temporal solitons may also serve 

as a .stabilization mechanism for multi-dimensional propagation [ 139). 

The scalar (2-^ 1 )-D cubic-quintic NLS equation is 


dA d-A wi; 

+ "/.o 


..eff 

1A|-+-^|A|-' 

/!■> 


A = 0. 


(2.98) 


Alone with the quintic term, any additional terms to the (2-i-l)-D nonlinear evolution equation that have comparable effect 
must also be considered, and the conditions determined under w’hich they can be neglected. This motivates the multiple- 
scales derivation of Chapter 3. in which a full vectorial derivation directly from Maxwell’s equation is performed and results 
in a coupled, non-paraxial. non-slowly-varying envelope (SVE), first-order vectorial differential equation for the propagation 
of orthogonal linear polarizations including nonlinear couplings with the weak longitudinally-projected field. In Chapter 6, 
reduced.'paraxial. versions of these equations are used to study the propagation of a single pump solitary wave and the vectorial 
interaction between pump and signal solitary waves. It is shown that quintic index saturation can stabilize against the effects of 
other, comparable, higher-order terms. 

The scalar equation suitable for the propagation of a single spatio-temporal nonlinear wave under the conditions of present 
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interest is, from section 3.2.3, 


. , dA d-A 




‘ k{) dTdx- 




a|A|-A 

dT 


,,eff 

+ 2A-5^|/l|"/l=0. 


The first term in equation 2.99 is the first-order propagator along the ; direction. The reduced time transformation T = t - k^z, 
where k'^ is the group-delay, is made such that propagation is nominally stationary in the new coordinate system. Paraxial 
diffraction along the .v direction and group-delay dispersion are represented by the next two terms, where kg is the group-delay 
dispersion coefficient and describes pulse lengthening under the SVE approximation. Third-order dispersion, with coefficient 
A"', and space-time focusing [144] follow, and are the first non-SVE corrections. The space-time focusing term describes the 
(paraxial) curvature of the energy front due to spatio-temporal diffraction. The final linear term is fourth-order dispersion. 

The first nonlinear term in equation 2.99 represents third-order nonlinear refraction followed by optical shock. The follow¬ 
ing terms describe the Raman nonlinearity. The effective quintic nonlinear index consists of three distinct contributions: 
directly from the fifth-order polarization, from the product of the third-order polarization with itself due to the reduction from 
Maxwell's equations to a first-order equation, and a “cascaded" contribution due to the nonlinear coupling between the funda¬ 
mental and third-harmonic, which can be tuned via phase-matching [211]. A simple estimate of the size of the cascaded quintic 
nonlinear index suggests that [134] nf « -20;i;, which is negative as desired for multi-dimensional stability. 

The initial conditions used in the simulations of Chapter 6 are the numerically-computed eigenmodes of the normalized 
.scalar i2-r 1 i-D cubic-quintic NLS equation: 


dll dru d-ii , TI , T I |4 
+ ‘i + 2q\u\ u 


( 2 . 100 ) 


where the follow mg definitions are made: ii = — nx' + 4^/0 ’S the total instantaneous non¬ 
linear retractoe index. : = c/ 2 A(,vc 5 . .v = x/no, t = s = -sign (k'^), and q = /iQ/if/«5^o"'r, with uo a measure of 

the trans\erse spatial width of the solitary-wave. For a bright spatio-temporal solitary wave with /i; > 0, operation must be in a 
region ol anomalous group-delay dispersion (AGDD) such that 5 = •+ 1 . The symmetry of this cubic-quintic equation allows it to 
be transformed into an ordinary differential equation [2] and subsequently solved via standard fourth-order Runge-Kutta [212]. 

The exigence of a radially-symmetric stationary solution to equation 2.100 of the form 


ti{y.T,z) = t/(p)exp(/Pc) 


( 2 . 101 ) 


Is postulated. v\herc - \ y* 
cquatu^n 


T-. Substituting this ansatz into equation 2.100 results in the ordinary nonlinear differential 


-f + [21 r- + 2qU^ - P] fy = 0. 
dp- p dp '■ 


( 2 . 102 ) 


which has fundamental and higher-order solutions corresponding to increasing optical energies, physical size, and number of 
zero crossings of the field (21. 

For the purpo.ses of optical switching, only the fundamental eigenmode solutions to equation 2.102 are of interest, which 
pos.sess the minimum size and energy [2]. It should also be noted that the higher-order modes are unstable to angular per¬ 
turbations even with a saturating nonlinearity [213]. There is a family of such fundamental solutions of different widths and 
amplitudes parameterized by the value of z/. Using the relationship between the field amplitude/! and the normalized amplitude 
II. the q parameter can be rewritten as 


l^o|- 


(2.103) 


where |/!o| and Uu are real and represent the peak value of the field amplitude in real and normalized units respectively. Now 
the quintic nonlinear index /i^*^ is written in the convenient form 
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where Asat is a real constant saturation amplitude. With this definition, the total refractive index seen by a scalar field is 


n — HQ +112 




(2.105) 


such that, for a peak field amplitude jAol = \/oAsat, the induced nonlinear index will be reduced from the purely Kerr value of 
/n to [1 - ct]/i->, where a = |Ao|" /Aj^t is the saturation parameter. The definition 2.104 is used so that the form of equation 2.105 
is consistent with the first two terms of the Taylor expansion of the two-level system saturation [214]. 

Including only the cubic and quintic contributions as shown in equation 2.105 cannot result in true saturation. When > 0 
and /if < 0*as in the situation here, the induced nonlinearity reaches a peak at [Aj' = A-Jl, then decreases and eventually 
becomes negative when |A| - > This behavior is an indication that higher-order terms in the material polarization expansion 
may need to be included, but one experimental measurement of the induced nonlinear index in PTS at X/ = 1.064 pm shows 
that this roll-over into a negative contribution does indeed occur [147]. Nevertheless, when a < 0.5, equation 2.105 mimics 

true saturation behavior. n''m'' — 

Combining equations 2.103 and 2.104 and substituting into the eigenvalue equation 2.102 and noting that - 

|A(j!" /A"j,, results in the following ordinary differential eigenvalue equation 


d-U 1 dU 

, T -1- ] f - 

dp- p dp 




- pt; = 0. 


(2.106) 


I’leurc 2.17 illustrates a few fundamental eigenmode solutions of equation 2.106 calculated as a function of peak normalized 
arnplitude ( o using the normalized saturation amplitude (/sai = \/l6 0. The heavy solid line (Uq = 1) corresponds to the 



normalized radius p 


Figure 2.17; Fundamental eigenmodes of equation 2.106 plotted as a function of peak normalized amplitude. Here, the normal¬ 
ized saturation amplitude t/sa, = 4, and o = 0.0625, 0.125.0.25, 0.5. and 0.6 for f/o = 1.0, 1.41,2.0,2.83, and 3.1, respectively. 
The inset plots show energy ratio (top) and normalized FWHM (bottom) as a function of Uq, with asterisks denoting the 
positions of the eigenmodes in the main figure. 

saturation parameter o = 0.0625, while the heavy dashed line ((/q = 2.83) corresponds to o = 0.5. The top inset shows the 
energy ratio as a function of peak amplitude when the o = 0.0625 eigenmode is used as the signal spatio-temporal wave. When 
using o = 0.5 for the pump, the energy ratio is 1.712. The energy ratio increases with increasing o, but when a > 0.5, the 
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induced index as given by equation 2.105 begins to decrease from the maximum when a = 0.5. The lower inset shows the 
variation in intensity full-width at half-maximum (FWHM). The FWHM initially decreases with o, reaches a minimum near 

a = 0.5, then increases as 0-4 I. . ^ • j / 

Figure 2.18 plots the o = 0.5 fundamental eigenmode (dashed curve) of equation 2.106 along with a = 0 eigenmodes (no 
saturation, solid curves) of peak normalized amplitudes of Uq = 2.83 and Uq = 1.94. The non-saturated Uq = 1-94 eigenmode 



normalized radius p 


Ficurc 2 In Numcricallv computed eigenmodes of equation 2.106 tor the saturation parameter a — 0.5 and normalized peak 
amplitude ( = 2 8.' (da.shed curves) and for the saturation parameter o = 0 with Ug = 2.83 and Uq = 1 94 (solid curves). The 

elleei ot ^.lluratlon is to broaden the width of the eigenmode near the peak where the saturation has its greatest effect. The 


sp.iee. time bandwidth products arc 0.312 for the saturated eigenmode and 0.242 for the non-saturated eigenmodes. 


is .1 scaled version ot the f,, = 2.83 non-saturated eigenmode with the same normalized half-width at half-maximum (HWHM) 
ot (I .'s 1 .^.'v as the saturated O = 0.3 eigenmode, and will be used in Chapter 6 to compare the eltccts ot the higher-order temporal 
terms on the pump with and without saturation in section 6.1. Note that lor the same HWHM. the saturated eigenmode will 
have a larger value ot (than the non-saturated one. The normalized FWHM of the intensity profiles of these eigenmodes is 
1 ()2~. while the I'Wll.M of the a = 0.0625 signal cigcnmode is 2.063. 

.Stabiliiv ot the cubic-quintic eigenmodes can be examined by the evaluation of the stability parameter [209] 



(2.107) 


where F is the integration over the eigenmode profile and repre.scnts power in the 2-D spatial case and energy in the 3-D 
spatio-temporal case Stability is ensured when 5 > 0, which is true for (1 + 1)-D NLS propagation, otherwise, propagation 
Is unstable For (2-I-1 )-D propagation, 5=0. and tor (3-sl i-D, 5 < 0 [2]. The first condition is the result of the fact that the 
power of the 2-D eigenmode is constant with rc.spect to width and eigenvalue [169], such that an increase in width, which 
reduces dilfraction/dispersion, is exactly compensated by nonlinearity, and a decrease in width, which enhances nonlinearity, 
IS exactly compensated bv diffraction/dispersion. The second condition results from the fact that the energy of the 3-D eigen¬ 
mode decreases w ith decreasing width and increasing eigenvalue [2], which is opposite to the 1-D case, such that an increase 
in width results in diffraction/dispersion overtaking nonlinearity, and a decrease in width results in nonlinearity overtaking 
' diffraction/dispersion. 

Figure 2.19 plots the eigenmode power (top) and stability parameter (bottom) versus eigenvalue for the (2+l)-D cubic- 
quintic NLS equation 2.106. It is shown that, like the 1-D case, the power increases with increasing eigenvalue (decreasing 
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nonlinear eigenvalue jS 

Fisiurc 2.19: Pint of power and stability versus nonlinear eigenvalue, indicating that quintic saturation results in stable (2+l)-D 
cigenmtide>.. The asterisks denote the positions of the fundamental eigenmodes of Figure 2.17. 

width), indicating stability.and S is always positive. The figure al.so indicates that eigenmodes with greater [3 (i.e. greater satu¬ 
ration) are more stable than eigenmodes with small p. The stability parameter asymptotically approaches zero with decreasing 
P which IS expected since o —v 0 as well. 

Stable propaeatic^n of the o = 0.5 cubic-quintic eigenmode is verified by numerical simulation of equation 2.98, as shown 
in Ficure 2.20 The width parameter is chosen as uq = 39.6 ;im. which results in intensity spatial FVVHM ot 40.7 pm, and 
temporal RVilM of 16.5 fs. The propagation distance is 15 Zo. over which the spatio-temporal solitary wave is unchanged^. 
The same stabilized solitary-wave behavior occurs in the fully 3-D case as well [2]. The confocal distance Zq = 0.559AoWq 
= 5.14 mm. and is calculated numerically. For the simulation, the parameters of fused silica (see Appendix D) are used at 
A; 1.55 pm The first section of Chapter 6 studies the effects of the higher-order terms of equation 2.99 on stability of this 
o = 0.5 cubic-quintic pump solitary wave eigenmode, and show s that stabilization does occur, albeit with some spatio-temporal 
broadening due to Raman scattering. 
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transverse distance (in /xm) 


Figure 2.20: Stable propagation of the a = 0.5 eigenmode of the cubic-quintic NLS equation without the effects of the higher- 
order terms. The spatio-temporal wave of transverse FWHM 40.7 /im and temporal FWHM 16.5 fs remains unchanged after 
propagating 15 confocal distances. The contours are at -3 dB intervals relative to the peak intensity in each frame. 






Chapter 3 


The Vector Nonlinear Wave Equations 


Optical nonlinear interactions generally fall into three regimes of operation: parametric, dynamic and transient coherent. Op¬ 
erating very far from resonance, where the material simply serves as an intermediary for the interaction between coherent light 
beams'or pulses, is known as the parametric regime. In this regime, the nonlinear response is purely reactive (real nonlinear 
susceptibilities) because material transitions are virtual, meaning that there is no exchange of energy between light and matter 
because the nonlinear response is a result of wavefunction deformation and is essentially instantaneous and lossless. As a result, 
energy is conserved among the beams or pulses of light and the Manley-Rowe relations hold [215]. The particular parametric 
case of the third-order nonlinear material polarization responsible for self-focusing and self-phase modulation is known as the 
optical Kerr effect, and is of fundamental importance for the optical nonlinear Schrodinger-type (NLS-type) equations derived 

in the chapter. . • j-r 

The dynamic and transient coherent regimes are resonant and differentiated by the ratio between the pulse duration and T 2 , 

the transverse relaxation (or transverse dephasing) time of the resonant excited state. The transverse relaxation time is related 
to the homogeneous linewidth by 73 = l^T/,. The homogeneous linewidth typically lies in the range GHz-THz for optical 
materials at room temperature, but in solid persistent spectral hole burning materials at low temperature. T/, can be as narrow 
as 100 Hz [216]. Indeed, much effort is directed at finding materials with slow dephasing at low temperature (along with broad 
inhomoueneous bandwidths) for applications in time-domain holography [217,218] and frequency-domain data storage [219]. 
Note that the inhomogeneous dephasing time is defined T: = I/tcT,. where T; is the inhomogeneous linewidth. The longitudinal 
relaxation time T\ of the excited state leads to the “sluggish" nature of nonlinearity and corresponds to the decay time-constant 
of ihe induced population change of the excited State. 

The dynamic regime of optical nonlinearity occurs near resonance when the pulse duration is greater than 73 . Operating 
in the dvnamic regime ensures that light-induced coherence is lost within the pulse duration and is obviously satisfied by 
continuous-wave irradiation. This nonlinearity produces real excitations because energy is exchanged between light and matter, 
and due to resonant enhancement, nonlinearity can be very large, but at the expense of large absorption. The transient-coherent 
restime is when the optical excitation is on a shorter time scale than Tz', therefore light-induced coherence persists longer than 
the excitation. Intere.sting effects arise in this regime such as [214] self-induced transparency (SIT), photon echos, and free- 
induction decay, and may allow for large nonlinearity with small absorption (but long response time) for specially designed 
pulses. 

The parametric and dynamic regimes of operation are studied in this thesis for optical soliton switching and computing. 
The tradeoff between the parametric and dynamic regimes is typically between low absorption and large (resonantly-enhanced) 
nonlinearity. The simulations of soliton interactions in Chapter 5 include absorption, and material figures-of-inerit are derived 
in order to determine the maximum gain that can be obtained by the logic gate due to the presence of absorption in candidate 
materials. These figures-of-merit show that the tradeoff must be made in favor of small nonlinearity and low absorption, 
restricting attention mainly to the parametric regime of operation. 

This chapter derives the fundamental nonlinear wave equation, which is a multi-dimensional and higher-order extension to 
the nonlinear Schrodinger (NLS) equation, appropriate for describing propagation in media of isotropic symmetry class and, 
with minor modifications, the cubic 432,43m and m3m symmetry classes. These choices are motivated by the presently avail¬ 
able materials fused silica (isotropic) and AlGaAs (cubic 43m), each of which satisfy the material figures-of-merit. Section 3.1 
starts with Maxwell's equations and the material polarization expansion and derives the vector nonlinear Helmholtz equation. 
In the parametric and dynamic regimes, the nonlinear polarization can be represented by a Taylor series expansion [220], and is 
the approach taken here. The general integral forms of the first, third, and fifth terms of the polarization expansion are manipu¬ 
lated into differential form thereby tran.sforming the integral-differential nonlinear Helmholtz equation into a purely differential 
one. The resulting second-order differential equation is difficult to handle numerically in multiple dimensions when long-time 
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evolution is desired, as it is here. As a result, a first-order equation is derived in section 3.2, with the two leading-order scales 
of the Helmholtz equation removed, which can readily describe long-time evolution. This section uses the asymptotic multip e- 
scales technique to derive the uni-directional, fully vectorial, nonlinear wave equation directly from Maxwell’s equations for 
(3+l)-D propagation in weakly nonlinear media. This equation is reduced to forms suitable to describe the (1+1)-D spatial 
and (2+l)-D spatio-temporal simulations presented in thesis which are the starting points for numerical analysis based on t e 
split-step method discussed in Chapter 4. 


3.1 Derivation of the Vector Nonlinear Helmholtz Equation 

In this section, the vector nonlinear Helmholtz equation valid for refractive and absorptive effects first-, third-, and fifth-order in 
the electric field is derived. The forms of these polarizations are then specified and examined separately in detail. The complete 
causal dispersive character of the linear susceptibility is retained in a compact time-domain formulation (Section 3.1.1) valid 
for less than unity fractional temporal bandwidths, while the third-order response is reduced from full nonlinear dispersion 
to the dominant time-domain forms describing the optical Kerr effect and fully dispersive stimulated Raman scattering (SRS) 
(Section 3.1.2), and the fifth-order response is reduced to instantaneous form (Section 3.1.3) where dispersion is assumed weak 

enough to be neglected. ... .... ... 

The starting point for deriving the vector nonlinear wave equations is Maxwell’s equations, written in difterential form. 

VxE = (3-1) 

dt 

dD /I -w 


VxH = 




where E is the electric field. H is the magnetic field, D is the electric displacement, B is the magnetic induction, J/ is the free 
current (or source current) and p/ is the density of free charge. Boldfaced variables represent vector quantities. In the dielectric 
media appropriate for studies in optics, J/ = py = 0. In order to couple the material response with the electromagnetic radiation. 

Maxwell's equations mu.st be amended with the constitutive relations 

B = AtoH (3.5) 

D = eoeE=£oE-l-P (3-6) 

where f is the relative material dielectric tensor, £<> is the free-space dielectric constant, P is the material polarization andpo is 
the permeabilitv of free-space. Note that the material is assumed to be non-magnetizable, i.e. p = po- 
Using the Maxwell's equations 3.1 and 3.2 and the constitutive relation 3.5 

d - 7 \ 

Vx[VxE]=-^(VxB) = -po-^. (3-7) 


which can be rewritten using the constitutive relation 3.6 


V X [V X E] = - - 


1 8-E 
c- dr- 


d-P 


where c = \/ i /puEi, is the velocity of light in free-space. 

The material polarization P consists of linear and nonlinear contributions; 

p _ pL pNI- _ pL ^ p(-) ^ p(3) _!-^ (3.9) 

where P '- is the second-order polarization, etc, and the expansion of the material polarization in a Taylor’s series converges 
when the material response is sufficiently nonresonant [220], such as in the parametric or dynamic regimes. 

Since this thesis is concerned with multi-dimensional and higher-order generalizations of the (NLS) equation, it is assumed 
that the lowest-order nonlinear contribution is The derivations are directly valid for the isotropic symmetry class, but are 
also valid for cubic 432. 43m and m3m with minor modifications. In centrosymmetric media, such as media of isotropic or 
cubic m3 and m3m symmetry classes, = 0. where ( is a positive integer. Cubic classes 432,43m and 23, have non-zero 
For the latter three cubic classes, second-harmonic generation can usually be neglected because the cubic classes are 
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linearly isotropic and phase matching is difficult to achieve. In addition, resonant generation of radiation around DC due to 
optical rectification can also be ignored because the phase velocity near DC is typically much smaller than the group velocity 
of the optical driving field, although important effects can occur even in the nonresonant case [195], but are not considered in 

^'^'^Separating the linear and nonlinear parts of the material polarization, the fully vectorial, nonlinearly-dnven wave equation 


IS 


1 3-E 


,2ijL 


V X [V X E] + ^ -tpo 


d-p' 


dr- 


= -pQ- 


92 pNL 

dr ’ 


or, in the alternate form 


, , , 1 a-E a^pNL 

V-E-V[V E]-^-p--Aio 3^2 -^0 9,2 ' 


(3.10) 


(3.11) 


where the vector identity V x [V x E] = V [V • E] - V-E is used. These equations are generally valid for nonlinear optics. The 
forms of the polarizations are now specified. 


The Material Polarization 

The most general form for the linear material polarization, for example, including the full spatial and temporal response, is [221 ] 

P^{r,t) = EoJ I(r',T)-E(r-r',r-T)^fr'<fT, (3.12) 

which describes nonlocal behavior in space, such as diffusion of real carriers, and causal nonlocal behavior in time. By 
emplov.ng the electric dipole approximation [221], and thereby ignoring optical activity as well, the spatial response will not 
be considered in this thesis. Therefore, the material polarization convolution integrals describe only spatially localized material 
respon.se but still retain non-zero time response. The form taken for the linear polarization is then 

P^(r,/)=:Eo^ S(r,T) •E(r,r-T)^fT, (3.13) 

where is a contraction operator and the impulse-response function is allowed to vary spatially as would Occur for layered 

media The linear impulse-response function ^ is a second-rank tensor and causality requires that the response is zero for x < 0 
(for T < 0. the material polarization would depend on values of the field E(r - x) that haven't yet arrived), justifying the lower 
bound on the integral. A second condition is that the respon.se function must be real because both the driving field and material 
polarization are real. Another common way of writing the linear polarization is 

p)-(r,/) =£()^ /?jr(r.x)£,.(r,r-x)rfx (3.14) 

where j.k £ [.v. v.c) and the Einstein summation convention over repeated indices is used. 

Similarly, the form for the third-order nonlinear material polarization is 

= Eo / f f P(r.Xi.x.;.Xj):E(r,/-Xi)E(r,/-X 2 ) X (3.15) 

Jo Jo Jo 

E(r,f-Xi)r/X|r/X2r/X3, 


where the third-order impulse response function /? is a fourth-rank tensor,: is a contraction operator and the induced material 
polarization depends on the product of three fields. Here, spatial dispersion can additionally arise from the diffusion or prop¬ 
agation of real particles, such as electrons or phonons (i.e. heat), which are created by the nonlinear light-matter interaction. 
These effects are neglected by assuming sufficient distance from resonance. As before, the third-order nonlinear polarization is 
written in component form using the triple Einstein summation convention over repeated indices 

pf\r,t) = toJ^ ^ /?j*/m(r.t,,X2,X3)£*(r,/-Xi) X (3.16) 

Ei(r,t -X 2 )E,„(r,t-X3)dx\dX2dX3. 


The third-order nonlinear polarization is responsible for four-wave mixing effects such as nonlinear refraction (self- and cross- 
focusing and self- and cross-phase modulation). Raman and Brillouin scattering, and third-harmonic generation. 
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For ultrashort pulses with large temporal bandwidths, such that the excitation duration is much shorter than the 
response time Ti, the time-domain formulation of the nonlinear material response is the appropriate representation [150,220] 
because the electric field can be considered a delta function and the convolution integral trivially evaluated. In the opposite limit 
where the optical excitation lasts much longer than the material response time, the frequency-domain susceptibility formulation 
is most commonly used, such as the case for the mixing of monochromatic beams, because the material response function takes 
on a constant value. This thesis deals with an intermediate nonresonant situation for which part of the nonlinear response can 
be considered instantaneous and hence represented by a constant in the frequency-domain, while part of the response is non- 
instantaneous such that the material polarization is nontrivial in either formulation. With these arguments in mind then, mstea 
of usino the more general form for the third-order nonlinear impulse response function in equation 3.16 (which is valid for 
nonresonant as well as near-resonant processes [220]). the following form can be used that is appropriate for the nonresonant 
processes just discussed [145,222] 

/^j^•;m(^,T|,T^T3) = /^;;^.,„,(^)5(T,)5(T2)5(t3) (3-17) 

+ -T2)5(t3) 

where the constant Rft,Jr) accounts for the strength of the instantaneous electronic, or Kerr, response and the Raman response 
funciion | l.'iOl (r,T| , 12 ) accounts for the slower nuclear response. This form does not describe general nonlinear disper¬ 
sion. but doe.s describe the most important effects to this thesis - nonlinear refraction (in both instantaneous and delayed forms) 
and ihird-harmonic generation and down-conversion. 

Raman scatteriniz is a nonlinear refractive/absorptive effect that arises from the rotational or vibrational response of a 
nucleus to ihe time-varying electric field. In the quantum picture, Raman scattering is the interaction between photons and 
optical phonons (i.e. high-frequency phonons with non-zero dipole moments). This effect is described by a time-dependent 
nonlinear refractive index [222] as evident by using the impulse response function of equation 3.17 in the convolution integral 
ot equation 3.16 

P'j^Hr,t) = eoR%,Jr}EUr.r)Ei(r,t)E„,{r,t) (3-18) 

-feo^ R%Jr.x)Ei(r,t-r)E,{r,t-x}E,„(r,t)dx, 

where T - T. = T:. The Raman response is studied further in Appendix B. 

The corresponding fifth-order polarization is 

P^/\r.t) = toRju„^Jr}E^ir.r]E,(r.t)E,„irj)E„irj}E„(rj}. (3.19) 

and is treated only m instantaneous form responsible for nonlinear refraction. 

Since the terms in the material polarizations depend on time history, it is useful to define the temporal Fourier-transform 

pair 


E(r.r) = E(r,o))e' 

E(r.O)) = j E(r,/)c'"Vr 




(3.20a) 

(3.20b) 


in order to examine the linear and nonlinear polarizations in the temporal frequency domain. For compactness of notation, 
all integrals without limits are taken to extend from —» to -foo. Using the Fourier transform relations and performing Tay¬ 
lor's expansions in the temporal frequency domain, the convolution integrals of equations 3.14 and 3.18 can be rewritten in 
differential forms for pulses of finite temporal bandwidths. as shown in sections 3.1.1 and 3.1.2, respectively. The reason for 
writing the material response in differential form is that the integral-differential wave equations 3.10 and 3.11 become purely 
dilTerential. Before this is done, though, the quasi-monochromatic representation, which is key to writing the material response 

as differential operators on the pulse envelope, is examined. 

The Quasi-Monochromalic Representation 

In this thesis, the quasi-monochromatic representation of the optical field is used such that in the time-domain 


E(r.r) = - |A(r,r)e"'‘''o'-l-A (r,f)e'“o' 


(3.21) 
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Figure 3.1: Quasi-monochromatic wavepacket of fractional bandwidth T] — 012. The left figure shows the representation in 
lime and the right figure shows the representation in frequency. The dotted lines in the time-domain plot illustrate the pulse 

envelope. 


The qua.si-monochromatic representation assumes that the field has some finite bandwidth centered about the carrier frequency 
ii\). meaning that the fast temporal phase variation is removed leaving only slower temporal variation in the envelope. In fact, 
the notion of an optical envelope has been shown to be valid down to the single-cycle regime [223], but the Taylor expansion 
of the Imear and nonlinear convolution integrals performed in the subsequent sections of this chapter does not converge in this 
remme where the fractional temporal bandwidth approaches or even exceeds unity. For generality, the envelope still retains a 
fast propagation phase variation, which is denoted by the over-bar. The quasi-monochromatic representation is illustrated more 
clearly upon transformation into the temporal frequency domain 

E{r,(D) =j^A(r,co) ♦5(t!)-o)())-h A (r.co) ♦ 5(a)-ha)o)j 

= ^ |A(r.o}-‘"f* A (r,co4-cOo)j (3.22) 

where • is the convolution operator. In the following sections, these expressions will be used to examine the material response 
in the temporal-frequency domain and define the susceptibility tensors. 

The time and frequency domain representations of a quasi-monochromatic wavepacket are shown in Figure 3.1. The frac¬ 
tional bandwidth is defined as the full-wadth at half-maximum frequency bandwidth AWfwhm divided by the center frequency 

A 

^ (3.23) 

C0() 

where the quasi-monochromatic representation is valid to the point where q -- i. In the multiple-scales analysis of section 3.2, q 
will serve as a small expansion parameter; therefore, the situatit)n of interest is when q < 1. Note that when the third-harmonic 
is considered, an additional slowly-varying envelope about 3it)t) must be defined, otherwise the tractional bandwidth of the 
enveU^pe A about the fundamental would exceed unity 


3.1.1 The Linear Material Polarization 

This section transforms the convolution integral of equation 3.14 into the temporal frequency domain representation P ((i)). The 
linear susceptibility is defined and Taylor-expanded such that the inverse Fourier transformation into the time domain results 
in a much simpler differential representation of 3.14. The derivation is presented in detail here in order to shorten the similar 
procedure used for the third-order polarization in the following section. 

Using the time/frequency Fourier transform relation, the linear material polarization given by equation 3.14 can be rewritten 

as 


pL(r,/) = Rjk(r.T) j Ek{r,(a)e ^l^O) dx 

= [/0(t)/?;A.(r,T)e'“«^T £*(r,co)e-''“Va) 
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= Jx>(r,o))£/:(r,(D)e"'‘“^/co 

where the order of integration is changed in the second step, 0 (t) is the unit step (Heaviside) function required by causality 
and normally incorporated into the definition of R, and the linear susceptibility is defined 


(3.24) 


Xy*(r,a)) = 




(3.25) 


which guarantees that Xjk{-a)) = XjkM since Rjk is real. An interesting manifestation of causality in equation 3.25 is the 
Kramers-Kronig relations. Equation 3.25 can be rewritten [224] 


, ^ r8(“) '■ 

xjk{<^) = + ^ 


" n J Q- 


0) 


dD., 


(3.26) 


where T represents the principal value. The linear susceptibility consists of real and imaginary parts Xjk - ^e{Xjk} + 
i Bn {Xjk } so that equation 3.26 is rewritten 




Q-co 

which arc the linear Kramers-Kronig relations. 

The linear material polarization is now written completely in the temporal frequency domain. 


(3.27) 

(3.28) 


^(r,co) = Xjk(r.t>y\)£k(r,(Jii)e~‘'^''duii 

= £*(r,(0,)r/(0i 

= Eo <^1)5(01-0)1 )rf(Oi 


(3.29) 


which indicates that the linear polarization is simply the product of the linear dispersive susceptibility with the amplitude of the 
temporal frequency spectrum of the field. 

Substituting the wavcpacket representation into equation 3.29 results in 


^(r.w) = A*(r,a)-a)o)-l-A*(r,CD-)-o)o) . 


(3.30) 


Since the wavcpacket representation is centered about tOo (and the conjugate about -coq), equation 3.30 can be rewritten by 
Taylor-e\pandingx(r.o)) about the central frequency o)u (and -tOo) 


^/(r.O)) = I 


^ [m-too]' d'XM(r,o)) 

X,r(r,a)o)4X—- 


.V=l 


(i)=(i)o 


Ai.(r,0)-0)o) 


(3.31) 




X7*(r-“o)+ S 


[o)-t-u)o]- 9'X;ii(r,o)) 


.v=l 


0a>' 


l(D=-(Oo 


A,.(r,a)-l-())o). 


Considering now only the linear polarization near the center frequency coo, and defining co' = 0)-0)o, the polarization is inverse 
transformed into the time domain 




, W a-Wro)) 

X>t(r’“o) + X “TT 


.v=t 


do' 


103=0)0 


Ak{r,(n')e-“^dvi 
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[coT' 

si 3o>' 




= Y + S Ji 




Here, the function of the operator is interpreted by its Taylor’s expansion about the central frequency coo- 

Using equation 3.32, the vector Helmholtz equation 3.11 can be rewritten for the quasi-monochromatic envelope 






A,(r,r)e-'“o'+ 5^ / r,coo +^ 


d'^P^+ir,t) 


with the corresponding conjugate expression. Here means the nonlinear polarization associated with the phase factor 

j,-»v y\n additional equation is required to describe the evolution of any additional radiation well separated from coq, such as 
the third-harmonic, otherwise the fractional bandwidth of a single envelope would exceed unity and the functional representa¬ 
tion of the linear material polarization would become invalid. 

Defminu the linear dielectric tensor 


eUr.wjs l+Xjkir,o)}, 


the vector Helmholtz equation becomes 


*€{.r.y.:} 


The second-order time derivative of the linear polarization is evaluated 


= i ( fl: -0)5! k (r.coo + /|:) A,(r.r)] je 




In general, the linear dielectric tensor is composed of 9 independent elements. For the purpose of this thesis, the material 
symmetry clas.ses will be restricted to those in which the linear dielectric tensor is isotropic. Isotropic refers to the case when 
all three elements of this diagonalized tensor are equal [225], i.e. 

E(to) 0 0 

E(a)) = 0 £(o)) 0 , (3.37) 

0 0 e(a)) 

otherwise the material is anisotropic. The linearly isotropic symmetry classes are cubic and isotropic. Note that the isotropic 
symmetry class is not a crystal class but a macroscopic description of liquids, gases or disordered or amorphous solids such as 
glass. 

The linear propagation and absorption functions are defined 
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where )t(r,(o) is the frequency-dependent linear propagation constant and a(r,a)) is the frequency-dependent linear intensity 
absorption coefficient, and both are allowed to be functions of space to represent material inhomopneity. The tensor notation 
is dropped for the linearly-isotropic classes because the dielectric tensor becomes scalar. Assuming that linear absorption is 
small compared to the propagation constant, equation 3.38 can be approximated 


'7 

lt"(r,(o) -I- ife(r,co)a(r,(o) se -j [e^(r,a))-f/e/(r,0))], 


where the following definitions are made; 


e^(r,(o) =!^e{e‘'(r,(i))} 
e;(r,(o) = Im {e^(r,(o)} . 


Now the linear propagation and absorption constants are written 

A-(r,co) = —eK(r,to) 

C" 

rvtw, tt\\ - - (i\\ i. 


a(r,co) = -eV(r,(o)/yeL (r,co). 
With these replacements, the Helmholtz equation is now written 

’-A ').-- - i ^ 


(3.42) 


A ( r,coo-l-i^ ) k' ( r.Oo-t- /^ ) -1- icx I r,coo-l-1^ 1 Ay(r,r) 


where only the form of the nonlinear polarization remains to be specified. 

3.1.2 The Third-Order Nonlinear Material Polarization 

This .section examines more closely the third-order material polarization, beginning first with the general response of nonlinear 
refraction. Usinu the same steps as for the linear material polarization, the third-order response integral is transformed into 
the temporal frequency domain. The frequency-domain representation is approximated for finite bandwidth excitation and 
transformed back into the time domain. Using the results of Appendix A, the tensor structure is reduced for the isotropic (and 
cubic 432. 43m and m3m) symmetry class. Finally, the fully general case is reduced to a form suitable for the description of 
the nonresonant Kerr and Raman nonlinearities as previously discussed, which are used throughout the remaining chapters of 

this thesis. i • • ia 

Following the procedure used for transforming the linear response integral in section 3.1.1, the nonlinear polarization 3.16 

is written in the temporal frequency domain as 


p'j^\r.(n) = ecJ J 


£,„(r.tU.-|) 5 (W-C 0 l -0)2 -tO3)<iC0iif(O2</C03, 

where the factors of 1 /2n have been neglected for simplicity, and the third-order nonlinear susceptibility tensor is defined 

X;i./m( 0 );r,a)i , 02 , 0 ) 3 ) S J J y^©(ti)©(t2)©(T3)/?j*/m(f.tl,t2,'t3)x (3.46) 

The Heaviside functions are included to enforce causality with infinite limits of integration. Again, because the third-order 
nonlinear response function Rju,,, is real, 

Xyiimlw; r, -0)1,-0)2,-0)3) = x}i:/„,“I’“2>(03). ( 3 - 47 ) 

In this thesis, the nonlinear processes of interest are those which are nearly frequency-degenerate, such that any generated 
frequencies are near that of the driving field at 0)o, and those involving the third-harmonic. These latter third-harmonic processes 
are discussed in Appendix A. 
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Using the quasi-monochromatic representation 3.21, equation 3.45 can be written 

1 ^ 4 (r, 0)1 + coo)A, (r, 0)2 - (Oo)A„, (r, 0)3 - coo) 

+At.(r, 0)1 - o)o)^i (r, 0)2 + o)o)A„,(r, 0)3 - Wo) 

+A*(r,Wi —a)o)A;(r,W 2 —0)o)A,„(r,W3 + Wo)j x 
5(0) — coi — CO 2 — C 03 )<ia)i £/(02^co3, 

which is valid for fully-dispersive nonlinear refraction. Now, the third-order susceptibility tensor is Taylor-expanded in the 
three frequency space [Wi, W 2 , W 3 ] about the center frequency Wq and its conjugate -O)o. Taking the first term in equation 3.48 
for example 

Xjklm - COo)Am(c 03 — (Oo) = (3.49) 

X>Wm(“o;-“ 0 ,“ 0 ,f«)o) + l“l +(^] 

(i)j=-a)o 


-1- [w: - Wo] 


-I- [W 3 - Wo] 


tOo,W 2 ,Wo) 




+ ••• X 


Ai.(Wi-1 -Wo)A/(W2-Wo)A,„(W3 -Wo), 

where the spatial dependence is dropped for notational convenience here and throughout the rest of this section. 

Usine the proper Taylor expansions for the remaining terms in equation 3.48 and transforming back into the time domain 
results in (cf equation A. 10) 

^ (0 = Y ^- Wo -f / ^ . Wo -t- / ^, Wo -I- / ^ ^ (r) A; (f) A,„ (r) (3.50) 

+X7i/mfti3o;wo-l-/^.-W()-l-/^.Wo-l-/^)Ai(r)Ai (r)A„,(t) 


+X;t/m^Wo;too + /^.Wo-l-(^.-Wo-l-/^jA|.(r)A;(r)A]'„(r) . 

The operator arguments are interpreted as before and the derivative operations are only performed on the corresponding field 
envelopes; thus, ordering of the field envelopes is important. Using the intrinsic and spatial symmetry properties discussed in 
section A.2. equation 3.50 reduces to (cf equation A. 11) 

/’I(0 = ^ [x; 7 ) 7 (^“o• + + i^'^Aj(t)Aj(t)Aj{t) (3.51) 

-h 2XyMy f Wo; - “0 + ' 4 • fk) + '■ 4. “0 + '■ 4) ^ (0^; (0 


+ Xy 7 afwo;-Wo-l-i^.Wo-i-/^,Wo-l-i^)A^(r)Ai;(r)A/:(r) e “"®', 


where the subscripts j ^ k. 

In general, as given by equation 3.46, all terms in the Taylor’s expansion of the third-order susceptibility about ±Wo have real 
and imaginary parts. This is a result of retaining the fully dispersive character. As mentioned previously, this thesis is concerned 
with the instantaneous and Raman responses. Even though the instantaneous response strictly means that the susceptibility does 
not vary over the excitation bandwidth, this thesis retains both the real and imaginary parts of the instantaneous susceptibility 
without considering the fully dispersive character. The reason for doing this is because the delayed response given by the Raman 
effect should dominate the remaining dispersive terms. In contrast to the instantaneous susceptibility, the frequency dependence 
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of the Raman susceptibility is retained such that the even-order terms in the Raman susceptibility expansion are real (giving 
rise to nonlinear refraction effects) while the odd-order terms are imaginary (giving rise to nonlinear gain/absorption effects). 

The full derivation including both the instantaneous and Raman nonlinear response is carried out in section B.3. The result, 
given by equation B.78, is repeated here for convenience 

+2x%t j («o; -t»o- (Oo, coo) ^ (r )A*(r )Aj(r) 

+ J { [ + xW“o;0):4;(/)At(r)] Aj(t) 

eo [Lr 3:4;.(r)Aa0lT,. ^ 

- 4 \ - dt -^ dt J 


EO 

8 


R [ dAl(t)Aj{0 -(Uor 

dr dr j J 


(L7(“o->0) Iz -+0) 








The third-order polarization includes scalar and vectorial (coupled) forms of nonlinear refraction, two-photon absorption, first- 
order Raman gam. and second-order dispersion of Raman nonlinear refraction. The phase-dependent vectorial terms are absent 
it orthogonal tmcular polarizations are chosen as the basis set rather than linear polarizations [226]. Here, the linear polarization, 
or Cartesian, basis set was chosen because these polarizations are also the eigenvectors for a planar waveguide. 

Delinine the Kerr nonlinear index, the two-photon absorption coefficient, the Raman nonlinear index, the Raman time 
constant, and the Raman index dispersion coefficient, respectively. 


«X = {x5^yy7(‘'^);-“0-“0.OJo)} 

(3.53) 


(3.54) 


(3.55) 

^ ^JJji 

^ ~ X;;;;(00(i;0) /IX + 

(3.56) 


(3.57) 


and the nonlinear anisotropy ratios 

Ak = y^kkjly^li) “ ^jkkj/Kjjjj — '^ikkj/'^Jjjj 
Yx' = y^jkkly^iii yr = y~jikk/K-jjjj—^jjkJ^jjjj' 


equation 3.52 reduces to 


f'>) = eo«o«;{[|^7(Ol'+2A|/l,(f)|-]A,(f)-fY^(r)A*(r)Ajt(r)}e '“o' 
4 - {[ lA;( 0 r + 2 Aa' |A*(/) 1 " j Aj(/) 4 -YxAj(r)Ai.(f)Ai(/) 


eo"o";tR 


dA;(r)Aj(/) ^ aA;(r)A*(f) - 

- dt - 
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where the total instantaneous nonlinear refractive index and the overall nonlinear anisotropy factors are defined 

/i2 = nAT + '^r? (3.59) 

^ + r^/? [A/? + Y/?] (3.60) 

^ 2 [/2 a: + «/?] 

»A-YA- + r2/?Y/? (3 51) 

/ 2 a: + r 2 /? 

Here, /ia' = [1 - //?]«:• "« = /«"2’ /« represents the fractional Raman contribution to the ultrafast nonlinear index. Note 

that for the isotropic symmetry class, 

2 Aa + Y;c = 1 

Ar + 2 ya= 1 , (3.63) 

and under Kleinman .symmetry, Aa =Ya!: ^/? = Yfi- j j 

The form of equation 3.58 is suitable for the multiple-scales analysis of section 3.2 because the terms are easily ordered 

based upon peak intensity and pulse duration. For the nonlinear Helmholtz equation of this section and the numerics of 
Chapter 4 though, the full Raman response can be used. The time-domain form is 

p(-^)+(j)_ (3.64) 

eo«o«A-{ [ |A;(t)l'-f2AA ITitrir] Ay(/)+YA^(0^r(0-4i(0}t"'‘'^^ 


- {[ \Aj{t)\- + 2Aa 1/U(0|-] A;(r) 


+ R'jjjj{-^)A'j(t-x)Aj(t-x)Aj(t)dx 

-f Aa R'jjjj(T:)Al(t - T)At(/ - x)Aj(t)(lx 

In the numerics, the convolution integral with the Raman respon.se lunction is easily evaluated in the temporal-frequency domain 
using instead the Raman susceptibility. 

3.1.3 The Fifth-Order Nonlinear Material Polarization 

The direct extension of the nonlinear polarization to fifth-order is. in the frequency domain. 




XJllmnJ OK tl) 1. CO;, (1)3,0)4,0)5 ) X 

£4 (0) 1) £( (o);) £,„ ((O3) £„ (0)4) £„ (0)5) X 

5((0 - CO] - 0 ); - 0)3 - (O4 — 0)5)cf(Di<f(0;rf0)3rf0)4rf0)5. 


Neglecting nonlinear dispersion and using the quasi-monochromatic wavepacket, the fifth-order nonlinear polarization at (Oq 
responsible for nonlinear refraction, written in the time-domain, is 

Pf*{t) = 


(3.66) 
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+ ••• 




where there are 5 C 2 = 10 total terms. 

From Appendix A, the fifth-order nonlinear refraction for isotropic symmetry is (cf equation A.23) 

= X ( 3 - 67 ) 

J 16 

|/l^(r)|%(r)-f ^ |A*(r)l"A;(r)-f ^ |A,(r)|-|A*(r)|%^^^ 

+1 |Ai.(r)|-Ai-(/)A‘(r) ^Af(t)Al{t)Aj{t) + +iAj(r)A:.-(r)A;(r) 

+ l\Ak{t)\-Aj(t)A)(t) + jAlHt)Aj(t)Aj(t) + j |A*(r)p|A/(r)|-A;(f) , 


where | =4 / and Kleinman symmetry has been invoked for simplicity, leaving only one independent component of the sixth- 

rank tensor [226]. Analogous to the cubic nonlinear index /i; and two-photon absorption coefficient P 2 . the quintic nonlinear 
mdc\ and ihree-phoion absorption coefficients are defined 


/I 4 = 

I 6 /I 0 

P 3 = • 


(3.68) 

(3.69) 


Note that the fullv-dispersive fifth-order Raman response is not considered. In fact, the zeroth-order retractive part of the 
fifth-order Raman response can be assumed to be included in the definition of / 24 , where Xjjjjjj redefined to include both the 
instantaneous and Raman contributions. 

Tsine the qumtic nonlinear index and three-photon absorption coefficient, the fifth-order nonlinear refraction polarization 
is final I \ written 


6 


X • P --' 


/J L 




(3.70) 


+ I K(0l' iAi(/)|M;(0 + j |Ai(r)|-AI(/)^(/) 

^A’f(t)Al{t)Aj(t)+^A'j(t)Al‘{l)Aj{t)-\-j\At(t)\-AJ(t)Xj(t) 


' J \- /■ J \ ^ 


5 


-io>of 


Again, ihc phase-dependent vectorial terms are included, but eventually will be removed due to the assumption of phase-velocity 
mismatch beiween the orthogonal modes of a waveguide, which washes out their effects over distances longer than the beat 
length. The pha.se-dependent vectorial terms are rigorously absent for the circular polarization basis set [226]. 

An additional, effective, quintic refraction contribution arises from the cascaded interaction between the fundamental at coq 
and the third-harmonic. This contribution will be derived in the multiple-scales analysis even though the result is valid for the 
Helmholtz development of this section as well. This cascaded contribution is the higher-order analogy of second-harmonic 
cascading, which leads to an effective third-order nonlinear refractive effect [ 193], and has been studied recently because of the 
possibildy for producing a large Kerr-like nonlinear index [ 196]. Note that an additional quintic contribution proportional to 
the square of the cubic Kerr index is obtained in the next section, and is the result of the derivation of a first-order equation. 


3.1.4 The Vector, Nonlinear Helmholtz Equation 

Using the third- and fifth-order polarizations from sections 3.1.2 and 3.1.3, the nonlinear Helmholtz equation 3.44 can be written 


V-A,(/) 




(3.71) 
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r Q 1 r 1 ^ 

-■4 + (Oo + 'V |-4j(/)|'A;(f) + 2AAr|Ai(/)p/ly(/) 

C** [ I Ot j I 

+yKAl{t)Tj{t)^ 

“o + '7j7 

+Ar J /fyyy/'t) |A;(.(/— t)| Ajdx 

+yRj^i^nd'^)Al{t--^)Aj{t-'^)Akdx 
+yRj^ /?;;7;('C)A’(r-T)A*(f-T)A*£/T 

«0M4 + '^| + |Aj(OrAj(/) + -|Ai(?)l'‘A;(/) 

+ ^|Ay(f)l'|A,(0pA,-(f) + ^|A*(/)|'A7(0^-(0 
+^0 A^f) A j (/) + jA} (/) a 7* (/) A j (/) 

+ 1 iAi{/)l"A;'(OAj(f) + ^A^'(/)A; {t)Aj{t) 

+ 5|Ai.(/)P|A;(/)|-Ay(0 . 

Equation 3.71 describes the general evolution of multi-dimensional vector waves including the effects of full linear refractive 
and absorptive dispersion, third-order nonlinear refraction, fully-dispersive third-order Raman scattering, and fifth-order non¬ 
linear refraction {neglecting the third-harmonic cascading term) and two- ancl three-photon absorption. This equation also de¬ 
scribes omm-directional and inhomogeneous propagation in that scattering from linear or nonlinearly-induced inhomogeneities 
is fully treated. In fact, the only approximations made in the derivation from Maxwell’s equations lie in the material polariza¬ 
tion. and the neglect of frequencies far removed from the fundamental coq which could be generated by a nonlinear interaction 
such as third-harmonic generation (which will be treated in the multiple-scales analysis). 

The vector nonlinear Helmholtz equation has no known analytic solutions, although approximate stationary solutions to 
reduced versions may be obtainable, for example for a scalar equation with truncated linear and nonlinear dispersion [139]. 
These approximate solutions must be numerically propagated in order to test for stability. Therefore, what is most useful for 
the soliton and .solitary wave propagation and interaction studies of this thesis is the nonlinear evolution of an initial vector field 
distribution into the far field (of linear propagation). As a result, equation 3.71 must be treated as an initial value problem rather 
than a boundary-value problem, which can only determine the equilibrium solutions, if any. Evolution over time is the most 
natural, but also the mo.st difficult because the nonlinear Helmholtz equation is differential in space and integral in time due to 
linear and nonlinear dispersion. 

One numerical method to evolve a unidirectional wave by a second-order differential equation is the Nystrom method [227], 
which is analogous to Runge-Kutta for first-order differential equations. Nevertheless, second-order equations such as 3.71 are 
difficult to solve numerically for problems where long time/distance evolution is desired. The main contributing factor is the 
leading-order scales, which leads to restrictive requirements on sampling. This difficulty plagues numerical techniques that 
directly solve Maxwell's equations, such as finite-difference time-domain (FDTD) [228], as well, and motivates a different 
approach in which a first-order, vector nonlinear evolution equation for the slowly-varying field envelope is utilized. This 

equation is derived in section 3.2. . 

The Helmholtz equation retains the fast variation due to propagation, which is approximately For wide angular 
spectra and backscattered radiation, this phase must be properly sampled. The largest propagating longitudinal or transverse 
wave number is ko = 2n/X, where X is the material wavelength, and v = 1/X is the largest propagating spatial frequency. 
The upper bound on sampling is given by the Nyquist limit, meaning that in order to properly sample the highest spatial 
frequency, the samples must be at X/2 intervals or less. Two additional considerations must be made. The transverse wave 
number for evanescent fields is larger than Ac, and the linear interference between forward- and backward-propagating radiation 
produces fringes with spacing X/2. Therefore, the spatial sampling must be X/4 or smaller in order to Nyquist sample these 
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oscillations. As a result, sampling at X/10 is generally considered a rule of thumb for linear problems using a second-order 
accurate numerical scheme [229]; for nonlinear problems, sampling at X/100 is commonly used. When nonlinear evolution 
into the far field is important (as it is here), it is difficult to formulate an accurate numerical technique that can resolve both 
fast and slow scales, because these fast scales dominates the calculation and any variation occurring on a slower scale becomes 
difficult to detect. 

Much work has been done on a technique known as homogenization, in which the fast and slow scales are separated before 
the numerical calculation. The multiple-scales analysis performed in the next section is a form of homogenization, where the 
mean propagation phase and group velocity are removed in the development of a first-order evolution equation. Therefore, 
the two shortest distance scales, which are order 1 and 1/E where E is a small parameter, are removed, with the leading-order 
distance scale, l/E*, representing diffraction and dispersion (which are far-field effects), and nonlinear self-focusing and self¬ 
phase modulation. As a result, sampling can be larger than a wavelength since the finest scale to be resolved is that of the field 
envelope. Of course, in the non-paraxial and non-SVEA regime, sampling still must approach X/2. The advantage here is that 
the sampling scales inversely with the bandwidth of the envelope; in the other schemes based on the second-order equation, the 
sampling is based on X, independent of the envelope. 

3.2 Multiple-Scales Derivation of the Nonlinear Wave Equation 

This section uses the asymptotic method of multiple-scales to derive the basic equations needed for the theoretical and numerical 
studies of optical solitons and their interactions presented in this thesis. Because of the numerical difficulties that the Helmholtz 
equation 3.71 poses, a different approach to describing the nonlinear spatio-temporal evolution of wavepackets is developed in 
this section. This equation is first-order in the evolution variable and possesses good scales in the sense that all fast variations 

are remtned. • j -u 

The multiple-scales analysis separates the fast scales from the slow, resulting in a nonlinear wave equation that can describe 
Iona time/distance evolution; this is especially important in the waveguide derivation of Appendix C where variation in one 
transverse direction is fast. From the derivation, the multi-dimensional nonlinear Schrodinger (NLS) equation is obtatned, 
which evolves over a distance scale I/E", while higher-order terms that evolve over distance scales 1 /e^ and I/e'^ are obtained 
in subsequent sub-sections. The point of this analysis is to remove the order 1 and 1/e scales from the wave equation, allowing 
for the use of much simpler numerical techniques. 

At each step in the perturbation expansion, an evolution equation is derived that contains all terms with significant contri¬ 
bution within some characteristic propagation distance. Higher-order equations subsequently extend the region of validity. The 
first-order equation accounts for wave propagation of the optical carrier, which is sufficient over a few wavelengths over which 
the field envelope does not change. A second-order envelope equation determines the group-velocity and is valid over distances 
smaller than the characteristic diffraction/dispersion/nonlinear lengths. Paraxial diffraction, group-delay dispersion, and cubic 
nonlinear refraction are added at third-order to reveal the multi-dimensional NLS equation. For distances well beyond the 
characteristics lengths, higher-order perturbation theory is required. The fourth-order equation adds perturbations to the NLS 
equation that describe linear and two-photon absorption and the first-order non-slowly varying envelope (non-SVEA) terms of 
third-order dispersion, space-time focusing, optical shock, and Raman gain. The fifth-order equation includes second-order 
non-S\ n.-\ terms, non-paraxial terms. Raman index dispersion, longitudinal field coupling, and quintic nonlinear refraction 
with contribution due to third-harmonic cascading. 


3.2.1 Multiple-Scales in Maxwell’s Equations 

Once again, the starting point is Maxwell’s equations, written in a form appropriate for dielectric media 


„ ^ dH 
VxE = -po-^ 

^ .. dP 

VE= —5-VP 
eo 

VH = 0, 


(3.72) 

(3.73) 

(3.74) 

(3.75) 


where the constitutive relations have been used to eliminate the electric and magnetic displacements. Spatial and temporal 
functional dependence is generally neglected for simplicity. Note that, with the use of the constitutive relations, the vector 



Final Report AFOSR F49620-95-1-0431, Spatial Soliton Interactions, S. Blair & K. Wagner, U. Colorado, Boulder 


75 


Maxwell’s equations contain six unknowns in the vector electric and magnetic fields, but separate into eight equations relat¬ 
ing those fields. The divergence equations are unnecessary, but will be retained in the analysis because they provide direct 
relationships for the longitudinally-projected fields. 

As before in the discussion of the linear and nonlinear polarizations, the quasi-monochromatic wavepacket representation 
is used for both the electric and magnetic fields 


E = -I- -h cc (3-76) 

H = -h -f cc, (3-77) 

2 2 


where it is assumed that the fractional temporal bandwidths of the fundamental and third-harmonic envelopes are each less than 
unity. The over-bar denotes the retention of a fast phase variation due to linear propagation. This fast phase will be explicitly 
removed in the multiple-scales analysis. The mean direction of propagation is chosen to be along the z axis, which is fully 
general for the isotropic symmetry class, but only strictly applies to cubic classes when propagation is nominally along one of 
the crystal axes. The third-harmonic is included in order to obtain the cascaded contribution to the quintic nonlinearity. 

With this wavepacket assumption, quasi-monochromatic Maxwell’s equations can be written in terms of the fundamental 

envelopes A and B, 



{3.78a) 

(3.78b) 

(3.78c) 

(3.78d) 



(3.79a’) 

(3.79b’) 

(3.79c’) 

(3.79d’) 


with the corresponding conjugate expressions. Here, the material polarization is separated into linear dispersive and nonlinear 
parts. The fundamental and third-harmonic envelopes must satisfy the above equations in order that the total electric and 
magnetic fields E and H. as given by equations 3.76 and 3.77 respectively, satisfy Maxwell’s equations. 

Key to the multiple-scales analysis is the suitable choice of small parameters that serve to scale the slow variables. The 
slow transverse variables are defined X =Kx,y = Ky. which are responsible for diffraction. The small parameter K ~ X/wq < 1, 
where uo is a measure of the beam width, guarantees that the width of the transverse angular spectrum Mr ~ 2n/wo is small 
compared to the linear propagation constant Icq = 27t/X. otherwise known as the paraxial assumption. In linearly isotropic 
media, only even-order powers of the small parameter K occur in the derivation, and the standard paraxial approximation 
truncates the diffraction expansion at K* resulting in the parabolic representation of the ;:-space surface. In this derivation, the 
expansion w ill be taken to order K^, which goes one step beyond the paraxial approximation. 

The slow time variable is defined T = r]t, where q ~ Aw/wo < 1 is the fractional temporal bandwidth. The small parameter 
q allows for the representation of linear and nonlinear dispersion by Taylor expansion in terms of time derivative operators 
instead of convolution integrals, as developed in section 3.1.1. Dispersion effects appear at all positive powers of q. For 
example, writing the linear dielectric constant in terms of real (dispersive) and imaginary (absorptive) parts 


(3.80) 
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the following expansion can be made 


/ ^ \ 3 Tl“ n 


(3.81) 


^4 


_ , _E^ (0)0) ^ + —((Oo) ^ • 

+ ;e-\;(coo) - (foo) ^ + • • •, 

where E is a small parameter that scales absorption and is defined later. If the envelope possesses fast temporal variations about 
the carrier (Oq, i.e. greater than unity fractional bandwidth, then the slow variable T can not be defined and the dispersion 
expansions are not valid. The slowly-varying envelope approximation (SVEA) truncates this expansion at rj-, while the non- 
SVEA development here extends the expansion to r)"* as shown. The imaginary part of the dielectric constant responsible for 
linear absorption is scaled three-orders smaller than the real part in order to appear in the derivation as a first-order correction 
to NLS. This is justified by noting that the development of the nonlinear polarization assumed that the optical frequencies were 
not in (ine-photon resonance with the medium and by the fact that only materials with small linear absorption are of interest 
for .soliton switching, as discussed in Chapter 5, which amounts to the requirement that absorption is small over the diffraction 

length, which is the length scale of NLS. • 

Weak nonlinearity is assumed such that wave propagation is linear to lowest order. This i s ensu red by redefining the 
einelives as small - vA and vB for the fundamental - where the small amplitude parameter v ~ scales the material 

pokirizaiion expansion. Since third-order nonlinear effects contain the product of three fields, nonlinearity will enter the wave 
equation two orders higher than first-order linear effects, which depend on only one field. Using the results of sections 3.1.2 
and .' 1.3. the nonlinear polarization with time dependence centered about the fundamental frequency (Uq is 

Pyv^Eo«o«21[ |Aj|‘-f2A|Aj(|‘j(3.82) 
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where jz^kril. The terms responsible for two-photon absorption are scaled one order smaller than the refractive parts so that 
iw 11 -photon absorpiion also appears one order beyond NLS. This is also Justified by noting that the development of the third- 
order nonlinear polarization assumed distance from two-photon resonance. As shown in Chapter 5, it is desirable for soliton 
switching (and true for nonlinear optical switching in general (115)) that two-photon absorption is weak relative to nonlinear 
refraction, which appears at the order of the NLS equation. Note that the term responsible for three-photon absorption is 
allowed to be the same size as fifth-order nonlinear refraction, however. The reasons for this are discussed later. The last term 
describes third-harmonic downconversion, and the form is derived in section A.2.3. 

The pertinent nonlinear polarization at the third-harmonic is (c.f. equation A.16) 

^ [xW3“o;Wo,Wo,(OoMy;4yAy (3.83) 

+ 3Xjav{3tOo;(Oo.“Oit»o)'4ji:Ai.Ay]. 
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The third-harmonic electric field S generated by this nonlinear polarization is assumed not to be present at the beginning of non¬ 
linear propagation, but is generated by the fundamental through a non-resonantly phase-matched process. For a single incident 
linearly-polarized wave (at (Oq), the only susceptibility tensor component that will produce a third-harmonic polarization wiAin 
the limit of the fifth-order derivation presented here is the scalar Xj jj ;(3Wo > ‘‘^o) • For other states of incident polarization, 

or for the interaction between orthogonally-polarized waves of interest here, more components are required as shown. 

The slow evolution scales are defined Z/ = e‘z, where propagation is assumed to be nominally along the z-direction and 
powers of e determine the scale of the overall variation of the envelope due to diffraction, dispersion, nonlinearity, and absorp¬ 
tion. At each order / + 1 in the analysis, a differential equation in Z; is obtained which groups all envelope variations of size 
e'"'" ‘. To this end, the envelopes are expanded in power series in e; 


A = A|-1-EA2 + e-As-f---- (3.84) 

S = £” Sj • 

B = B|-l-eB2-he-B3-t- 

T = e-T3-!-•••. 


The hisher-order terms in the expansion represent corrections to the initial envelopes Ai and B|. Each step in the analysis forces 
the lonsi-distance behavior (at Z,) of the envelopes such that equation 3.84 remains a uniform asymptotic expansion. The scaling 
for the third-harmonic S (in which the first contribution appears at the third-order of the analysis) is set to be consistent with 
s-eneration by the product of three fundamental envelopes Ai. The third-harmonic interacts with the fundamental to produce a 
ca.scaded fifth-order nonlinearity [211] of sign and magnitude which are adjustable through phase-matching. Fifth and higher 
odd-order harmonics could also be generated, but do not affect the nonlinear wave equation for the fundamental at the order of 
the present derivation and are therefore ignored. 

The small parameters, k for the fractional angular bandwidth, i) for the fractional temporal bandwidth, v for the fractional 
change of the dielectric constant due to nonlinearity, and e as an overall scaling factor, determine the relative scaling among 
terms (and physical effects) in the perturbative expansion. The perturbation expansion follows a particular connected path 
through the three-dimensional lattice of terms indexed by powers of K, r\, and v; higher-order terms (more distant from the 
origin) are always smaller. A particular physical situation determines the sizes of the small parameters relative to E, and allows 
for the derivation of the appropriate evolution equation. For example, a nonlinear spatio-temporal wave of short duration and 
large transverse size will result in the scaling k = i)- = v- E ev The terms associated with diffraction will then be driven to 
hicher order and the resulting evolution equation at third-order perturbation theory, or E-\ will be temporal NLS. Subsequent 
orders f* and higher will reveal non-SVEA terms (and paraxial diffraction at ep. Conversely, a spatio-temporal wave of long 
duration and narrow transverse size is described by the scaling t) = K- = v- = £-. moving the ultrafast terms to higher order and 
yielding the spatial NLS equation at E^ after transforming into a ctMirdinate system to remove the group delay. Again, further 

analysis will reveal non-paraxial terms (and group-delay dispersion) at £ . 

The most general equation, retaining as many terms as possible under the initial assumption of weakly nonlinear propagation 
of a field envelope, occurs when all physical effects manifest themselves over commensurate distances of propagation and 
results from the scaling K = ft = v E £. One result of this condition is that the multi-dimensional nonlinear Schrodinger 
(NLS) equation is obtained at order E"^. Any other choice of initial scalings, such as the scalings valid for the numerical 
simulations presented later, would lead to one or more effects being less important than the others, resulting in the paraxial or 
SVE approximation, as previously discussed. In terms of the slow variables, the time and space derivative operators become 


a a 


dt 

V 



= .t— + V;^ + - 
dr ov 




d: 


dz 


-he 




(3.85) 

(3.86) 


where Z)) is replaced by z so that uppercase represents slow (envelope) variables and lowercase represents fast (propagation 
phase) variables. In the analysis, Zq represents zeroth-order perturbation theory with characteristic length scale on the order of 
the carrier oscillation, Z, is first-order perturbation theory with length scale of group delay, Z 2 is second order resulting in NLS, 
and so on. Note that now the field envelopes are functions of z and the slow variables, i.e.. 


A-> A{X,K,Z,->o,T,z), 


(3.87) 


where the fa.st z dependence will be removed in the analysis. 
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Now, the quasi-monochromatic Maxwell’s equations for the fundamental can be written, in component form. 




ar ac az, 


d 9 . 1 -r 


az az, 


y : ——h -f ... At ” £ ^ v 


dAy dAx . r . ^ 

'w"“»+''§F *=■ 


(3.88a) 


(3.88b) 


(3.88c) 


which is Faraday’s Law and relates the electric and magnetic fields such that a time-varying magnetic field serves as a source 
for the orthogonal rotation of the electric field. 


. 91- ra , a , 


9 L / 9 ^ 'T I ^ p(Nl-)+ 

-lEo O)o + IE^ e + 


9 9 1-5 9B;_ 


-(£0 too + /E3:r e^(wo + 'e^)Av + — 


lAj +lp(NL)+ 
arJ^-^+eo^-^ 


dBy dBx 


(3.89a) 


(3.89b) 


(3.89c) 


.• 3 \ T , 2_-5(NL)+ 


-/Eo + e P + 


w hich is .Amperes Law (modified by Maxwell) and states that a time-varying electric displacement drives orthogonal circulation 
of the magnetic field. 


arA 9x ai' [az az, 

2 f ra a 


Eo 1 




dY [dz az, 


which IS Gauss’s Law and constrains the longitudinal electric field polarization in order to guarantee that the electric displace¬ 
ment IS divergence free, and finally 

E 

which slates that there is no isolated magnetic charge and ensures the magnetic field is divergence free by proper choice of 
longitudinallv-ptilarized magnetic field. 

These equations .serve as the starting point for the next section which presents the derivation of the (3 -e 1 )-D vector nonlinear 
wave equation for the Cartesian polarization components. This choice of the polarization basis set is arbitrary. The reduced 
dimensional derivation appropriate for nonlinear propagation in a planar waveguide is presented in Appendix C. In this case, a 
(2-^l )-D vector nonlinear wave equation results because of one dimension of linear confinement by the waveguide. Again, the 
Cartesian polarization components are used, but now this choice is not arbitrary since the TE and TM modes of the waveguide 
appear in orthogonal linear polarization states. Slight modification of the above set of Maxwell equations is necessary in the 
guided case since, in one dimension transverse to the mean direction of propagation, the material polarization depends on 
position, and the envelope in that dimension may no longer be slowly varying. 


dBx 9Bv 
dX 


dB- dB- 


3.2.2 Derivation of the (3+l)-D Higher-Order Vector Nonlinear Schrodinger Equation 

The propagation of and interaction between optical solitons is of primary interest to this thesis. The use of one field envelope 
A does not preclude the description of multiple soliton interaction with arbitrary states of polarization. Indeed, a first-order 
nonlinear wave equation will be derived for each polarization component in Cartesian coordinates, and any number of soliton 
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envelopes of arbitrary polarization state can be decomposed into this basis set. For example, for the interaction between two 
solitons of orthogonal linear polarizations, the initial conditions would be 

Au=MX,Y,0,T) (3-90) 

Aiy=fyiX,y,0,T), 

where and fy represent the envelopes of the individual solitons. Using solitons of orthogonal circular polarizations, 

/i„ = ±[A(x,K,o,r)+/_(x,K,o,r)] (3.9i) 

,,, = -;=[/+(X, r. 0, r) -/_ (X, F, 0, r)] , 


Mx.rAT}^-^ (3-92) 

/_(X,K,0,r)^^ (3-93) 

represent the soliton envelopes of right- and left-hand circular polarizations. 

As a result of the choice of optic axis, z, the wave equations for the x and y polarizations will be identical in form due to 
the isotropic and cubic symmetry. Now the terms are grouped in powers of e. At all orders. Ax and Ay satisfy similar equations, 
while A; is different due to linear vector coupling in Maxwell’s equations and the fact that propagation is along the z direction. 

Order £ - Phase Velocity 

At the lowest order. Maxwell’s equations constrain the phase velocity. Writing out the coefficients of e, in Cartesian form. 


' . - 
x: =-ipoOioBix 

dz 

y ■ iv 

z: 0— 

which allows the electric fields to be determined from the magnetic fields, 

. dB]y [ . 

•r: -^ = /eoa)oel,^(mo)Ai.v 

y: =-/eoa)oe^(tOoMh■ 

;: 0 =/eo(Ooe^(too)Ai;, 

which allows the magnetic fields to be determined from the electric fields, and 


(3.94a) 


(3.94b) 

(3.94c) 


(3.95a) 


(3.95b) 

(3.95c) 


which provide no new information. 

It is immediately seen that the longitudinal electric and magnetic field polarizations are zero to this order, leaving only four 
non-zero field components. This is expected since the projection of the electric polarization onto the propagation direction is 
smaller than the transverse projections because of the initial choice of scalings. These scalings dictate that the fast propagation 
variation is along z and the transverse variation (i.e. diffraction) is described by the slow variables X and F. 

Taking the z derivative of equation 3.94b and substituting equation 3.95a, the x-polarized electric field equation is written 

=-eoiUo(05e^((Oo)Au 

A\x = A\xe‘'^' 


(3.98) 
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where Aijt is a slowly-varying envelope, £oPo — 1/^' c is the speed of light in vacuum. The + sign is chosen here for 
forward-going waves, and backward-going wave are neglected b^the^sumpdon of weak nonlinearity, which forces wave 
propagation to be linear to this order. The same relation holds for Aiy, and Biy. The wavepackets travel in the z-direction 

with some mean wavenumber ko, or propagation constant Icq = resulting in the mean phase velocity Vp - 

(Do /ko for the underlying carrier. 

Using the expansion 3.84, the relevant equation at lowest order is 


= -koAx+ O(e^), 


(3.99) 


which determines the phase velocity. The order e* term emphasizes the fact that this equation is only accurate to order e. 
Equation 3 99 allows the removal of the fast phase variation such that subsequent analysis will reveal approximate evolution 
equations for just the slowly-varying field envelope. Note also that, since A,, = 0, the total longitudinal field A, = 0(£-), which 
will be determined in the following section. 

Order £■ - Group Velocity 

The previous order determined the phase velocity of the underlying wavepacket carrier, this order determines the group velocity 
of the wavepacket envelope. Faraday’s Law (equation 3.88a) results in 

, 9A->v 9Aiv o , dBijc 


* 9^42v _ t; 


while Amperes Law (equation 3.89a) results in 


dB2\ 95 iv l / \t 




y : -)- = -ito(not.j^((no)A2y 


(3.100a) 


(3.101a) 


(3.101b) 


+£o E 


:^(cDo)-i-(0o£^(coo)j ■ 


Finall\. in order to ensure divergence-free fields. 


^^l.v ^Az- _ 
dX dY dz 

dBu dBzz _ „ 

ax dK dc 


(3.102) 


(3.103) 


Here, the divergence equations are used instead of the c-component of the curl equations because the former provide direct 
relations for the longitudinal fields. 

Performing the same steps used in the previous order, the equation governing A^ is 


2£^((Oo) + (Do£^(u)o)] 


(3.104) 


The homogeneous solution to the left-hand side of equation 3.104 is 

Ai,=A2,e''^% (3-105) 

such that the right-hand side, which has the same phase factor, is secular. In order to suppress unbounded growth of the 
particular solution for Ai, (which would be « ce'^o"). and therefore ensure that equation 3.84 is a uniform asymptotic expansion, 
this secular term must be set to zero, resulting in 


2//:o^^ = 2£!j^((Do)-l-(OoE^(Wo) 


(3.106) 
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Noting that 


2ko/i = 2^Je^{m)l-Mo^o) + 




ret'(Wo) 


= 2^e^(wo) + ^e^(wo), 


equation 3.106 can be rewritten 




(3.107) 


(3.108) 


(3.109) 


which is the lowest-order evolution equation for the slowly-varying field envelope and has error of e, which represents higher- 
order corrections that will be obtained in subsequent sections. Equation 3.109 well describes wavepacket propagation over 
distances short compared to the dispersion/diffraction/nonlinear lengths, and subsequent analysis will increase the accuracy of 

the approximate envelope wave equation by successive orders of £. i • « 

Making the substitutions/!^ = eA.,, d/dz = ed/dZi, and d/dt = td/dT, the overall evolution equation for the .v-polarized 

field envelope is currently _ 

^ + A'^ = 0(e^), (3.110) 

which is accurate to order e-. The error will be reduced to order e'* by deriving the NLS equations in the next section. Equa¬ 
tion 3.110 has the traveling solution 

AAx,y,z,t) = A,(x,y,z,t-k^z) (3-111) 

where k' is the group delay coefficient and Vg = I/Aq is the group velocity coefficient. The solution reveals that to this order, 
the en\ elope simply translates along the direction of propagation at the group velocity without change, which is clearly valid 

within the spatial and temporal confocal ranges of propagation. 

Based on the derivation thus far, the fractional temporal bandwidth small parameter can be more formally defined. The 
group-velocity appears one order of e later than the phase velocity, which implies the following relation 




(3.112) 


which, in the temporal frequency domain, is rewritten 

|AwAo>4| 


■ e|M| => e ~ Awp 


Since the group delay is defined 


WoE‘^(Wo) 

kk =- 1 - / ^ 

^ 2cyE'^(coo) ^ 


(3.113) 


(3.114) 


which is appro.ximately the phase velocity, it is seen that E ~ |Aw/wo| (more properly, tj ~ |Aw/wo|); therefore, the small 
temporal scaling parameter is the fractional bandwidth. For large fractional bandwidth approaching unity, more terms in the 
Taylor expansion of the dispersion need to be retained to properly describe dispersion over the pulse bandwidth. 

Using equations 3.102 and 3.103, the longitudinal field envelopes are written 


A^. = — 


( rd/!|, , 3/!iv 


= — 


Ao L ax 

i rasit, aBiv 


kr>[dx aF J 
i ra^u a/iiy 


(3.115) 


(3.116) 


-pooxildy dx ^ ^ 

The over-bar is removed because all components travel with the same phase velocity Vp. Again, substituting the amplitude 
expansions, the total longitudinal electric field is 


1 raA^ aAv 


Ao I ax ay 


(3.117) 
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The slowly-varying longitudinal electric field envelope is driven by transverse derivatives of A;, and and thus arises from the 

(paraxial) projection of the X and y polarizations onto the z axis. 

Order - The Multi-Dimensional NLS Equation 

The multi-dimensional spatio-temporal nonlinear Schrodinger (NLS) equation shows up at this order in both the x and y po¬ 
larization components of the electric field. The third-harmonic is also generated. The terms from the quasi-monochromatic 
Maxwell’s equations are 


dA^- 3A3v 


5r“ a?" 


TH; ^ = 
dz 


(3.118a) 


, dAif dAzx . 9Au 3A2; . -h ,, 


TH: 

t\K ihc clcciric field envelopes of the fundamental and third-harmonic, and 

dB2z dB^v ^B2\ dB[y 


dr dz dZj dZ. 


ieotoo e^{a)o)A.-,., -i-/ e^(coo) 


dAzx 3^ , 2 -^3)1 


' 111" + — 

2 dr- to " 


+eo^ E!i^(wo)Ai, + »e!^(too)^r 


(3.118b) 


(3.119a) 


TH : g = 3/eoCi)o6!)^( 3(00)*^3.1 


dB^x dBzx dB\x dB2z _ 


, , . 1 -, ,9A2v E^(“o)a-Ai, , 2-^i) 

-/6o(Oo ej^((Oo)A3v + »Ej^(t'>o)'0^” 2 dT- "^eo • 


+Eo^ E^(tDo)A2v + /E3^(a)o)-^ 


Til : ^ — = —3iEoCOoel;^(3u)())5,Ti — b/cOoP,.^, 


(3.119b) 


iic magnetic field envelopes. The final equations from the divergence of the electric displacement and magnetic field are 


e\h (coo) 


dAzx dA2x ^>^3: , 


(3.120) 


+»El((a)o 


'A 1 4- -I- 


dx^ dr 


TH: ^ = 0 
dz 


dBzx dB2\ dB^z , dB2z _ ^ 


(3.121) 


TH: 
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where an immediate simplification of equation 3.120 results by substituting equation 3.102 

0)^ 


The third-order polarizations are written 


(3.122) 


^^^ = eo/io«2 |'4ul''4u + 2A|/liy|'^ij:+VAi.v^uJ (3.123) 

=eO/IO«2 |Aly|'Aly-t-2A|Al.t|'i4|y-|-Y^lj^lj'] 

^,3 = yXjx«(3(Oo;mo,a)o.a)o)^u + (3.124) 

>^^3* = ^Xxc«(3(Oo;COo,(Oo.COo)a|v+^X nyv(30>o;Mo,(Oo.O)o)AJy4iy. 

Here, the nonlinear polarizations consist of third-order nonlinear refraction for the fundamental and the source terms for the 
third-harmonic driven by the fundamental. 

As before, the equation governing A., will be developed first. Taking the z derivative of equation 3.118b and substituting 
equation 3.119a results in 


"a^ +■*"3?r"*“3r= ■'‘"IT 


(3.125) 



dB^, 


d 

\ "aT^ ■ 

azj 

” az,' 

~^dT 


)Ait + {(^o) 


,a.4iv , 2_(3) 

+/ 80 CO 0 2 37 - 


which, after substitutions from order can be rewritten 

f2j 5’Ait 9 ~^!.v 

+ A:o'^3.t: - 32. ax* dXdY 

1 f t aMiv ^‘'Aiv 

-iko +'^^>>0 — 


(3.126) 


dfi 2 \ _ Ik) ^ 4 1 , ^ r 


- e<) ((Do) + woeIj^Iojo) 


-(£0 [e!^( 0)0) -h ^e!{ (0)0) + 2io)()i^/' 


Setting the secular term to zero results in 


. . 3A|, d-A|, d-.4i, o)o f , , 0)0^■'^1' 

-“^si^nr^syr - ? i=k(“o) + ifr 

, fd.A’, 36;vl 952v 

+ 2 p() 0 )o/’, =-ikt -^+ 1 ^ 0 -^ -</JoO)o-^ 


(3.127) 




Using the relation dAu/dZ^ = -l^dAuldT and substituting from equation 3.100a, the right-hand side of equation 3.127 

can be written _ _ 

[SA;, ,dA2,l t»>o[E:^(‘')o)] d~A\x (■) P«) 


(3.128) 


Noting that 


-kokl = - ^ - 




4 £L (o)o) 


(3.129) 
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substituting equation 3.129 for the right-hand side, and writing the nonlinear coupling in full, equation 3.127 reduces to 


(3.131) 


(3.132) 


[5^1, ,9^1^ aAu] d-Au I (3.130) 

-|-2;t5—[l/lu|-Au-h2A|Aiy|'Au + YA?v>llJ = 

Using the amplitude scaling relation Ax = sAj ^ + ^“Azxy the substitution 

A-eA + c2_L (3.131) 

9; “ 9Z, ^ 9Z,’ 

the slowly-varying envelope equations 3.109 and 3.130 can be written in lab coordinates 

^ , 3Ajc ,/3A,rl d^Ax d'Ax , n 139 ) 

2 //:o + ^ + 

-l-2)t5^^ [lA,,r-A,-h2A|A,pA,-hYAXl = 

no I ^ 

.At this point in the analysis, the full evolution equation for the field envelope is valid to order e^. Equation 3.132 is easily verified 
by making the substitutions into the slow variables and amplitudes and comparing with the envelope equations obtained in this 
and the previous order. Note that there are extra terms containing Ai^ and A 2 v that have not been obtained in the analysis thus far. 
These terms, for example d-AiJdX-, 2 A|Au|-A 2 t, and Y/^5,>4I.t. are all of order e'' and higher and fall beyond the e- accuracy 
of the current state of the derivation. These terms will pre.sent themselves in the next sections as the accuracy improves. 

The leading-order distance evolution scale Z, « 1 /e of equation 3.132 represents the group delay, or group velocity. This 
scale can be easily removed with the Galilean transformation T = t-k^z, with the resulting leading-order scale (Z 2 « l/e‘) 
repre.sentmg diffraction/dispersion and self-focusing/self-phase modulation. This is a principal advantage of the first-order 
equation over ihe Helmholtz equation. 

When the coordinate transformation T = t — k^z is made and Ay = 0, equation 3.132 is the (3-^*i)-D scalar NLS equation. 
Combined wwh the corresponding equation for Ay, equation 3.132 represents the coupled or vector NLS equations. The next 
seciu)n u ill determine the first-order corrections to the vector NLS equations. Since there is no coupling from A; to A., or Ay, 
or linear ci^uplmg from A, directly to Ay to this order, the coupled NLS system is a fully vectorial one for isotropic and cubic 

media. n u u 

* The transverse and nonlinear scales become apparent at this order. The terms obtained at this order are £“ smaller than the 

terms obtained at order E. Therefore, ,, ,, 

l^ + ^Le-i^Aj. (3.133) 


Bv transforming into the Fourier domain, the small parameter can be written 




= sinG. 


(3.134) 


This relation indicates that the transverse momenta must be less than the propagation constant ko in order that K < 1, as shown 
in Figure 2.2 for k,. When k > 1. evanescent wave propagation must be considered. The transverse FWHM momenta are 
related to the width of the wavepacket by kjc,ky ~ 2n/uo. where circular symmetry is assumed. This relation gives the scaling 
K ~ Because sinG = k^^/ko < 1, the exact isotropic A-relation k: = - A; - A; is reduced to the paraxial expression 

A- % Ao - k-/2ku - A 7 / 2 A 0 . as given by the spatial Fourier transform of the linear part of equation 3.132. 

By the same arguments, the nonlinear scale is written 

v = e~,/^lA,l, (3.135) 

V ”0 

w hich means that the nonlinear polarization expansion is well-ordered and converges when v < 1. This occurs when the induced 
nonlinearity is weak compared to the linear index of refraction, as indicated. 

Performing the same steps with the third-harmonic envelope results in 


-A-( 3 a)o)S 3 .- 


ir(3COo)Ai^-j-3Xi»<jr(3tOo)'4iyAij; , 


(3.136) 
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which is recognized as a wave equation driven by a third-harmonic source term. The wavenumber for the third-harmonic is 
defined it^(3(0o) = 9cOoe^(3ci)o)/c^. Because the driving polarization has the phase factor e so must the third harmonic in 
order for this equation to have a solution. The solution is then 


Xx„^(3coo;coo,t0o,(0o) [Af^ + 3AArAf^.Au 
9kl-k'^(3(ao) ^ 


(3.137) 


where Ak is the cross-phase modulation coefficient for the Kerr nonlinearity. The resonant denominator indicates that the 
strength of the third-harmonic grows near the phase-matching condition /:(3c0o) « 3fco (or n(3cOo) ss /i(Wo)), but the unphysical 
pole is a manifestation of the fact that the derivation is not valid at the resonant condition, for which initially exponential growth 
of the third-harmonic would occur. The pole is not a problem in practice because the phase matching condition is difficult to 
achieve in condensed, linearly isotropic media. In fact, for useful nonlinear dielectrics with low loss, it is typically true that in 
the optical regime, /i( 3 ©o) > «(wo)> which has important ramifications to multi-dimensional stability. 

Using equation 3.122, the longitudinal electric field envelope at this order is 


i [dAi , dA 2 x , 9A2 v 

[dz7 ^ 'W 


(3.138) 


Combining the equation for the longitudinal electric field envelope with equation 3.115, the total longitudinal field envelope is 

+ (3.139) 

ko [ d.v dy J dr [ dx ay _ 

The form of the longitudinal field will be elaborated on in the next section. 


Order e"* • The non-SVEA Corrections 

In this section, the first set of higher-order corrections to the NLS envelope equation are obtained which allow for the description 
of the propagation of short pulses. These corrections are important in studies of propagation over distances greater than the dis- 
persion/nonfinear lengths and are: linear and two-photon absorption, third-order linear dispersion, space-time focusing, optical 
shock, and Raman scattering. The evolution of the third-harmonic is not considered in this section because any corrections 
obtained here to the solution S 3 of the previous order will not enter the final analysis. At e"* then, the terms from Maxwell s 
equations are 

nt-i. 9 A 3 ; dA 4 v dA 3 v dA;, dA|y _ (3.140a) 


(3.140b) 


3^3: 

3A4v 

3 a 3v 

3A2v 

dA|y _ 


3K 

dz 

3Z, 

dz. 

az, “ 







dB^x 






-pOdT 

dA4,r 

3A3v 

3Ai, 

9Au 

3^3- 


dz 

3Z, 

dZ. 

dZj 

dX 







dBjy 





ipi)0)oB4y 

~^^~dT 


for the electric field envelope, and 


dBi- dBiy dBjy dB;, dB|v _ 

■ 3 ? ^ ~ ~ 117 ~ dZj ~ 

I , .— I'/ \^A3j( (^ 0 ) d^Ait 

-/eotoo e5^(tOo)A4,t-l-iej^(o)o)-g^:-2- dW 




,2^4) 


dA2x £x'((^o)d^Au . 2 


(3.141a) 


+eo^ E^(wo)A3^ + te^((rto)-^ 


2 372 eo ^ 
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. dB4, . dB3, , dBz, , dBi, 

^ ■ "aT az, dZj ax 

nT , .vL' 


(3.141b) 


-/eocoo 


■‘■^ar 


1 — I' / \ 

e^(o)o)44v + jej^icoo) -2-^ 

(o3o) a^Aij, \T _L ^ 

6 3r T+'MO)oMly+-Py 

,dA2y e^'(coo) a^]4iy , 2-^3) 

eo ^ 


e^(a)o)A3v + 'e3t(“o)-0j^ 2 dT^ 

for the magnetic field envelope. The final equations are from the divergence of the electric displacement and magnetic field 

(3.142) 


dA^r 3A3v dA^- dAy dAo- 

-aF+#+ir-^^^a^ 


'eoe^ (too) 




ap 


+ 


ax ar 


■+ 


Bz 


I4\ I 9g3v BB 4 . agj; dB 2 z _Q 

(4) • :sv + av + a, + 32| ^ aZj 


(3.143) 


(3.144) 


ax ay ‘ az 

The third-order polarizations are written 

= EO«0«2 { 2 A |v4iv|“j A2 z+Y ^1jc + ^1>’ 

which IS the nonlinear polarization along the z-axis due to coupling between the transverse and longitudinal field, and 

2|>ilx|'^lr + Ai^.A2x+ 2A|/\iy| A2x-\-^A^A2x (3.145) 

4“2AA 1 1 y “h 2A4 |\-A2 vA \x 4* 2yA [yA2yA 


Px = £o«0«2 


-f-/— \A I” l.r “H 2Aa' 1-4 1V j l.r “h YAT^ h*^ lx ] 
2k f I ' 


-Eono'*2'l^/?' 




Au+yR 


^A^yA\x aAi^iy 


BT 


BT 


Aiv), 


with a similar expression for . The first set of terms in the transverse nonlinear polarization consist of mixed field envelopes 
from different orders of the field amplitude expansion because Az, can be substituted for one Ai.^ for example, giving an extra 
factor of E. The second set of terms represents two-photon absorption while the last set represents the first-order approximation 
to Raman gain. 

Following the .same procedure used in the previous sections, the condition on the x polarization component at this order is 




dA3x , 3A3., 


BZ, 


B'Azx ^‘Azx 1 

”01^“ azr 




BT 


baz, a-iAst 

■ az, ax- 

Wa I • > 


(3.146) 


e!)^(a)o) 4 2a)o£i)^(o)o) + *^£^(^ 0 ) 


9“A2r 

*3^ 




elj^Ccoo) -l-a)oe^'(too) -t- y 


az, “azidz; c- [ 

2 2-5(4) 


a^A 




ar3 


apl^> 


- ikft‘i((ao)Au- —kfPx -4/>io(Oo-^. 

Upon further manipulation and the use of equation 3.130, the secular term becomes 
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(3.147) 
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where the third-order group-delay dispersion coefficient A-'q" is 


^ e^(coo) + tooe^(a)o) + ";^£^ (^>^o) 


Once again using the amplitude scaling relation 


and the c scale relation 
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the full slowly-varying envelope equation to this order is written 

raA^ ,aA^i a"A.v d~Ax , iu^Ax 


(3.148) 
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where u<i = 's the linear absorption coefficient and K = P2/2^o«2 is the normalized two- 

photon absorption coefficient [119], The other additional terms to NLS are third-order dispersion, with coefficient k!^, and the 
so-called .space-time focusing [144] term. The space-time focusing term is one of a more general class of terms denoted here 
the space-time dispersion terms, and describes the (paraxial) curvature of the energy front due to spatio-temporal diffraction. 
Followine nonlinear refraction and two-photon absorption, optical shock is the first-order time derivative of the (vectorial) non- 
linearitv and uives rise to an intensity-dependent group delay. The optical shock expression obtained here has two contributions, 
one directly from Maxwell's equation in the time derivative of the nonlinear polarization (proportional to phase delay), and one 
due ti> the removal of the S VEA by one order (proportional to group delay). The last term, proportional to the time derivative of 
the (vectorial) intensity, is the first-order (linear) approximation to the Raman gain/loss function, with time constant Xr defined 
by equation 3.56. and results in amplification of low frequencies at the expense of high frequencies. 

Solving equation 3.142 for A 4 : using the fact that A 4 ; has the same propagation phase as all other field components, the 
contribution to the longitudinal field at this order is written 


i pA?; 3/42; ^^3y 

dx ^ dY _ 

2i [a^ d^ ^ 

'^"^eoeliltoo) 3X aK dz 


(3.152) 


Combining this equation with A2z ‘ttid A^z^ the total longitudinal field is 
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(3.153) 
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/ \d^A, d^A, i [ aNL?> 8 n4^^ ' 

2kl dx^ ^ dxdy- dy^ dv9jr-J ^o«0 dy 



where the nonlinear terms are defined 

NLi^'=«:[lA.y|-A^ + 2A|A^|-A;y + YA;A;] (3.154) 

NLt^’=/)2 |Av|-Av + 2AlAy|-Av + YAX . (3.155) 

There is a simple geometrical interpretation for the origin of the longitudinal field. Considering only linear propagation for 
the moment. Figure 3.2 illustrates the relationship between the projections of the polarization onto the a: and z directions given 
an angle of propagation 0 with the z axis. In the Fourier domain, the ratio between the transverse and longitudinal components 



l ieuie .3,2 Origin of the longitudinal field projection. The field envelope A is decomposed into a plane-wave spectrum A, 
with one member of that spectrum illustrated, propagating with angle 0, where sin0 = k„/ko. The direction of polarization 
Is orthi’eonal to the direction of propagation, and is decomposed into transverse (Ay) and longitudinal (A;) components. For 
simplicitv. tilt m the \ direction is not illustrated, but is a straightforward extension of the present geometry. 


IS given b> ^ 

Az= - lanGAt = 

This geometry is easily extended to include the y transverse direction^ with the result 


^ [k^A.X + kyA^,^ — 


kx^X “b kyAy 

/ k“{oyQ-\~ Aco) — kj^ — 


(3.156) 


(3.157) 


The truncated Taylor's expansion, when transformed back to the real space domain, recovers the linear terms of equation 3.153. 

Analoiious to the projections of the transverse electric fields, the longitudinal field is also driven by the projections of the 
transverse nonlinear polarizations onto the :-axis. The paraxial projections appear in the nonlinear terms of equation 3.153, 
with additional contributions from the nonlinear coupling between the longitudinal and transverse fields. 


Order - The non-Paraxial Corrections 

The results of this section are derived mainly for the purpose of obtaining the form of the propagation equation, including 
all commensurate terms through the fifth-order (or quintic) nonlinearity, which will be used later in the thesis as an ultrafast 
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saturation mechanism to stabilize multi-dimensional solitary-wave propagation. The fifth-order nonlinearity obtained here 
consists of three distinct contributions: directly from the fifth-order polarization, from the product of the third-order polarization 
with itself, and a “cascaded” contribution. The second contribution is a result of the reduction to a first-order equation and the 
cascaded contribution arises from the interaction between the fundamental and third-harmonic. 

This derivation also illustrates the non-paraxial and analogous higher-order non-SVEA terms. The terms additional to 
the quintic nonlinearity which show up are then; dispersion of linear absorption, fourth-order linear dispersion and the non- 
paraxial spatial diffraction analog, second-order space-time dispersion, second-order shock (intensity-dependent group-delay 
dispersion) and the spatial analog intensity-dependent diffraction, Raman index dispersion, and nonlinear coupling with the 
longitudinal field. Some of these terms will be used in the remaining chapters of this thesis as well. 

The two Maxwell’s equations that are necessary at this order are: 
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where the additional nonlinear polarization term is defined 
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(3.158) 


(3.159) 


(3.160) 


q- mixed terms, 

which are the fifth-order terms from equation 3.82. The fir.st set of terms in the nonlinear polarization is the direct contribution 
from X'^’. while the second set of terms is the cascaded contribution from the interaction between the fundamental and third- 
harmonic. The third group of terms is the third-order nonlinear coupling with the z component of the electric field, the fourth 
is the Raman index dispersion and the last represents the mixed terms involving Ai;c, Azx, etc, which are not specified for 
convenience. 

Using the same method employed before, the fol lowing secular equation for the evolution of the slowly-varyingx-component 
of the electric field envelope is obtained: 


+ dr- 


(3.161) 
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The third line from the end contains terms with spatial derivatives of the third-order nonlinear polarization. The first three of 
these terms are vectorial resulting from the divergence equation, while the next two terms result from the reduction to a first- 
order equation. All of these terms correct the curvature of /.-space (which is now slightly anisotropic) due to nonlinearity, termed 
here intensity-dependent diffraction. The next line contains first-order shock terms associated with two-photon absorption and 
Raman scattering, and the second-order shock associated with nonlinear refraction. The last line contains an additional term 
which IS the result of the reduction to a first-order equation (discussed next, which contains further non-SVEA and non-paraxial 
nonlinear contributions) and the cascaded and direct quintic nonlinearity. 

Evaluating the remaining Zz derivative term results in 


^7 ^ 


i-^-^--ikfnz\2\Au 
Zquo dZ-, L 




(3.162) 


-h2M,,A,v-^ + 2Al4,,|-^-h2YA,,^,,-^+YAT,-^J, 

which contains effective quintic nonlinear terms from the product of the cubic nonlinearity with itself and additional intensity- 
dependent diffraction and second-order shock terms. Keeping only the effective quintic nonlinear refraction terms 

eo/io oZt L"oJ *■ 

-1-2 [2A-l-Y^] |A|,,|‘ |A|v|'Ai.,- f y[ 2A-|- l]AJ-Ai^.Al,v 
+ y[ 2A— l]AY,4i-4i.t-l-2Y|A|y| |, 

which IS similar in form to the direct contribution from x'^'- 

This contribution can be more easily understood intuitively by considering the total scalar relative dielectric constant 

e = tr (|A|‘j = «o -I- 2/10/1; |A|' -f 2 / 10/14 lA]'*, (3.164) 

keeping terms only up to fourth-order in field. In a second-order equation, such as Maxwell’s equations or the Helmholtz 
equation, /r is the relevant quantity for wave propagation, while in a first-order equation, n is the relevant quantity. Using the 
appropriate Taylor's expansions, 

„(|A|-)r«„o + /,;|A|--f / 14 -g |A^ (3.165) 

which has the direct /14 contribution from x*^’ and an indirect contribution manifested by the expansion of the square-root as 
required in the first-order equation. 

The ca.scaded interaction between the fundamental and third-harmonic produces another effective fifth-order nonlinear 
interaction. Evaluating the third-harmonic down-conversion terms of the fifth-order polarization given by equation 3.161 

,-casc j. . T 9 

2k ^—— |Ai,|‘*Ai4-1-3AAi’YA'|A iv| Ai4-I-6Aa;1Ai;(| |Aiy| Ai^ 

Wo >■ 

+ 3AA-A|;AYyAi.t -hYArAYiAjYAu + 2Ak jAiyf AfyAjj^ j, 


(3.166) 
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where Ak and Ya: are the cross-phase modulation and vector four-wave mixing coefficients for the Kerr nonlinearity (the third- 
order Raman susceptibility cannot produce a third-harmonic) and the cascaded quintic nonlinear index is defined 

casc_ ^ Xxx;cr(3(Oo; (Op, COp, a)o)Xxcxx(tno; 3(1)0, —(Op, —tOp) (3.167) 

'^32^0 n2-n2(3(Oo) 

This is the same form as the effective n 2 produced by the nonresonant interaction between the fundamental and second- 
harmonic [194] mediated by which has received recent interest [196]. Resonant third-harmonic cascading has also recently 
been studied [211], with results away from the phase-matching condition similar to that derived here. 

Writing out all three contributions to the vectorial quintic nonlinearity results in 
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(3.168) 


which will be used in the final higher-order NLS equation. Note that the coefficients of the intrinsic quintic nonlinearity are 
only strictly valid for the isotropic symmetry class under Kleinman symmetry such that A^r = Y/c = 1/3 must also be used for 
the cascaded contributions. 

Three-photon absorption, with coefficient P 3 , is included in this study for completeness and to allow for resonant enhance¬ 
ment of the the quintic nonlinear index. The use of three-photon enhancement would violate the Kleinman symmetry assumed 
in deriving the cross-phase modulation coefficients of the fifth-order susceptibility, however. In three-photon resonance, the 
third-order susceptibilities responsible for third-harmonic generation and down conversion will also be enhanced. The imag¬ 
inary parts of these susceptibilities would need to be considered as well, and is a straightforward extension to the present 
derivation. In this situation, the third-harmonic cascading process would also produce an additional, effective, three-phomn 
absorption coefficient Three-photon resonance is not used in this thesis, but this discussion is provided as an illustration 
of one means of enhancing the saturation effects of quintic nonlinear refraction. 


3.2.3 Reduction to (l-hl)-D Spatial and (2-1-1 )-D Spatio-Temporal Equations 

As mentioned previously, the scalings used for the derivation of the (3-i-l)-D spatio-temporal nonlinear wave equation are not 
the same as those that are valid for the numerical simulations performed in this thesis. In fact, the result of the derivation is a far 
more general vector equation than necessary for the simulations. This section reduces the full equation to simplified equations 
that are more easily dealt with numerically while still describing the necessary physics. 

In both cases, linear confinement in the y dimension is provided by a slab waveguide structure, as shown in Figure 1.20. 
A detailed analysis of the influence of the waveguide is presented in Appendix C up to the order of the NLS equation. For 
simplicity, these results are not used directly here, but some simplifications are used that are the result of the discussions in 
Appendix D. which makes direct use of the waveguide equations derived in Appendix C. The main simplification is that all 
phase-dependent terms in the nonlinear polarization are neglected under the assumption that the difference in phase velocity 
between the TE and TM modes results in a beat length much shorter than the wave interaction distance. 


The (l-^l)-D Vector Spatial Nonlinear Wave Equation 

In this situation, which describes the propagation and interaction among purely spatial solitons, the temporal derivatives and 
they transverse spatial derivatives can be neglected. Also, by setting e = 0.1, the diffraction and nonlinear small parameters 
satisfy K = £, and v = e*. These scalings are appropriate for the numerical studies of Chapter 5. Note that in the (1 -h 1)-D case 
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of spatial solitons, quintic index saturation is not necessary for stability and will not be included because the expected intrinsic 
value of for the example material is too small to produce any noticeable effect. 

Using these scalings, the appropriate evolution equation is 

... dAj, . .. . . d-Aj, 1 d^Ax (3.169) 


dAr d^Ax 

2ik()-^-{- ikQCtoAx + 0^2 




+ {[1 + iK] |A,|-A, + 2 [A + IKAk] |A,|-A,} = 0, 


with the corresponding coupled evolution equation for Ay. The first term in equation 3.169 is the first-order propagator. Linear 
absorption is next, followed by paraxial (second-order .v derivatives) and non-paraxial (fourth-order x derivatives) diffraction. 
Third-order nonlinear refraction and two-photon absorption including scalar and vectorial contributions (neglecting the phase- 

dependent vector four-wave mixing terms) follow. 

Stimulated Raman scattering is not present in the cw case (although the Raman contribution to the instantaneous nonlinear 
index is retained), and it is assumed that spontaneous Raman scattering, which has not been discussed, is negligible. Note that 
the linear non-paraxial term is retained, while the nonlinear non-paraxial terms are neglected. This is justified by the fact that 
V = e-, which places the nonlinear non-paraxial terms order e- smaller than the linear non-paraxial term. 

The (2+1 )-D Vector Spatio-Temporal Nonlinear Wave Equation 

Like the previous case, here one spatial dimension is neglected due to transverse confinement by a slab waveguide. Therefore, 
the terms with v derivatives are not included, but all temporal derivatives must be retained, subject to the small scaling parame¬ 
ters. .Acain, setting e = 0.1, the scaling of the small parameters appropriate for the simulations of Chapter 6 are: k = x] = e, 
V; = E-' and V 4 ='e. where V: ~ |/i 2 //iol'/'|A|, and V 4 ~ |n 4 /nol’^'‘|A|. The first two scales indicate that the non-SVEA terms 
are more important than the non-paraxial terms. The temporal scaling parameter is larger than the spatial parameter because 
.short puLses are necessitated in this thesis by the desire for short propagation distances combined with the small anomalous 
group-delay dispersion of available materials such as fused silica. The nonlinear scalings indicate that the cubic nonlinearity is 
proportionally weaker than the quintic nonlinearity. Therefore, the quintic nonlinear terms will dominate over the second-order 
non-S\’E.-\ nonlinear terms, and the latter can be neglected. Even though the strength of the Raman nonlinearity is about the 
same as the instantaneous cubic Kerr nonlinearity, the spectral bandwidth of the spatio-temporal waves used in the simulations 
exceeds the optical phonon resonance frequency. Therefore, a three-term approximation to the Raman response is insufficient to 
correctly describe intra-pulse stimulated Raman scattering over the entire spectral bandwidth. For the simulations, it is assumed 
that the full Raman re.sponse enters at order with the Kerr nonlinearity, as discussed in Chapter 4. 

Using the scalings appropriate for the simulations then, the higher-order terms included in the reduced equation are the quin¬ 
tic and R^iiman nonlmearities and those terms that are the result of the removal of the slowly-varying envelope approximation 
by one order: third-order dispersion (TOD), space-time focusing [ 144], and optical shock (183]. The fourth-order dispersion 
(FOD) term is also used, but the other second-order non-SVEA terms need not be included, as discussed above. In the reduced 
time coordinate frame, where T = t- k'^z and Fq is the group delay at the center frequency (Oq. the (2-i-l)-D vector nonlinear 
evolution equation takes the form 
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This equation is referred to simply as the higher-order, multi-dimensional vector MLS, or vector hNLS, equation. 

The terms in equation 3.170 are, in order: propagation in the reduced-time coordinate frame, linear absorption, transverse 
paraxial diffraction, group-delay dispersion, third-order dispersion, space-time focusing, fourth-order dispersion, nonlinear 
self-/cross-phase modulation (SPM/CPM) and self-/cross-two-photon absorption, self-/cross-Raman amplification, vectorial 
optical shock (two lines including nonlinear phase modulation and Raman amplification), Raman index dispersion, and qumtic 
nonlinear refraction (last three lines). The higher order non-SVEA terms of third- and fourth-order linear dispersion, optical 
shock and Raman scattering were described in more detail in section 2.4, while the spatio-temporal terms are discussed in 
section 4.1 and linear and two-photon absorption are discussed in section 5.3. 




Chapter 4 

Spatio-Temporal Diffraction and the Split-Step 
Numerical Method 


The nonlinear evolution equation derived in the previous chapter can be solved with a variety of numerical techniques. In fact, 
the purpose of using the multiple-scales analysis was to remove the fast scales in order to reduce the complexity of the numerical 
method This is in contrast to the vector nonlinear Helmholtz equation, which is second-order and possesses variations on the 
order of an optical wavelength and requires more complex methods for numerical solution. 

The principal methods in use today to solve first-order partial differential equations are the finite-difference [212,230] 
and split-step methods [231,232]. The finite-difference method becomes cumbersome when dealing with multiple dimensions 
and hiL’her-order derivatives, such as those required for third- and fourth-order dispersion and non-paraxial diffraction. This 
chapter examines the more aesthetic and intuitive split-step method. The split-step method solves, in alternating steps, the linear 
diffraction problem in ifs natural Fourier domain and the nonlinear, or inhomogeneous, problem in the real-space domain. 

The split-step method is illustrated schematically in Figure 4.1. The prototype problem to be solved is propagation of an 
initial Held through a volume with inhomogeneous (spatially-varying) dielectric constant. The split-step method can be applied 
to quite general problems where the inhomogeneity is due to refractive, diffractive, or layered structures, but this thesis con¬ 
centrates instead on inhomogeneity due to nonlinearity. Ignoring Fresnel reflections, the effect of an inhomogeneous dielectric 
constant on a propagating field is the accumulation of a spatially-varying complex phase. Instead of accumulating this phase 
continuously throughout propagation, the split-step method applies the phase (representing the inhomogeneity throughout a thin 
slabi at discrete points. Between these points, the field is propagated as if the medium were linear and homogeneous. The key 
characteristic of the split-step method is this alternation between linear diffraction and inhomogeneous phase accumulation, but 
It should be noted that the method, like the underlying first-order differential equation, is limited to unidirectional propagation 
and cannot be used when the longitudinal variation of the inhomogeneity is strong enough to produce significant backscattered 
radiatu'n 

Section 4 1 describes the exact spatio-temporal linear diffraction problem for isotropic media and examines the linear spatio- 
temporal terms in the higher-order NLS equation. Section 4.2 derives the split-step method and discusses the application to 
( U1 i-I) spatial and (Z+fi-D spatio-temporal propagation. Finally, section 4.3 covers issues related to the implementation of 
the .spin-step methiHl. w ith particular focus on the accuracy of the algorithm and how it scales with step size. 


4.1 Linear Spatio-Temporal Diffraction 

This section examines spatio-temporal propagation using the linear, scalar Helmholtz valid for homogeneous, isotropic media. 
Under these conditions, equation 3.44 can be written 

|^ + Vf4+it-(coo-b/|:)A=0 (4.1) 


where the V [V • Aj term is identically zero (because now V • D = 0 implies that V • E = 0) allowing for the consideration of a 
scalar equation. Vf is the transverse Laplacian. and A is defined 


£(.v,y. 





(4.2) 
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Inhomogeneous Propagation 



Split-Step Method 

linear homogeneous steps 

AZ/2 AZ AZ AZ AZ AZ/2 



a5 "'* ' 'az'' '* ' az'■■■‘■■■■az""* ■ az 


inhomogeneous steps 

Figure 4.1: Representation of the split-step method. The top figure shows the desired propagation problem with index variation 
throughout the volume. The bottom figure shows the approximation used in the split-step method. The volume is divided 
into slabs of thickness Ac < T with the inhomogeneous index in each slab reduced to a spatially-varying phase applied at the 
discrete positions along the mean propagation direction (represented by the heavy vertical lines) located at the midpoint of each 
slab. The field is propagated between these midpoints as if the medium were homogeneous. 
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using the quasi-monochromatic representation as before. 
The spatio-temporal Fourier transform pair is defined 


E[kx,ky,z,Ui) = j J E{x,y,z,t)e-‘^'^'r’^^-'^UrTdt 


where kr = kJ + kyV represents the transverse spatial frequencies and rr = xr-i->'>> is the transverse position vector. The 
spatio-temporal Fourier transform of the electric field becomes 


E{k:c,ky,z,(n) = ^ [A{k,.ky.z,(i)-(iio)+A‘(kx,ky,z,(ii + (iio)] ■ 


An equation will be developed that describes the evolution of the Fourier components with propagation distance, using the 

transverse spatio-temporal Fourier transform of equation 4.1. ■ -n • 

In chapter 3. the two leading-order temporal scales were removed from the evolution equation. The carrier oscillation 
is removed by the use of the quasi-monochromatic representation, while forward-going propagation at the group velocity 
is removed by the coordinate transformation T = t- k^z. Using this transformation to the reduced-time coordinates, the 

wavepacket becomes _ 

A(x.y,z.t)-^A{x,y,z,T-k^z). (4-6) 

Then, by the Fourier shift theorem, A -> The evolution of the Fourier components of the wavepacket in reduced 

coordinates is then ^ ^ 

_ [»; + J;] + t:(A0, + = 0, (4-7) 

which, upon application of the second derivative with c. becomes 

S +2,A0)tS^ + [»’(A10 + 01,,) - k; - i; - A0,7tJ=] 4=0. (4.8) 

OZ- OZ 

liquation 4.S can be solved by assuming a solution of the form Aic) = i4(0) exp(Yc), thereby removing the fast propagation 
phase and producing a quadratic equation in y 


+ 2/Aa)A:QY + [a**(A o) aX)) — kj ^ ] — 0. 


with the familiar solution 


Y = ^ I —2/A(jl)A^ ± yj [2/Aa)/:Q] “ 4 [A- ( Aco -h (Oq ) -* k“ — k;. — Aco-Aq“] | 

= ± / yjk'^ (Aco + (Uq ) — — k^^ — /AtoA'o ^ 

This solution includes both forward and backward going waves (with the backward going wave traveling at twice the group 
velocity with respect to the reduced frame), but since it was assumed there is no inhomogeneity, these waves are uncoupled. 
Therefore, the initial conditions can be chosen such that only a forward going wave (the “+ sign) need be considered. This 
is the crucial step in reducing the second-order differential equation into a first-order one and can be done exactly for linear 
homogeneous problems. 

The forward-going solution is written 

A{k,.ky,z.Ao)) = e'^(/:^,A'v,0,Aa)), (4.11) 

where /4(0) is the transform of the initial field distribution in the transverse coordinates and time and 

(Ao) -h coo) - A'“ - kj - AcoAq (4.12) 

is the c-directed momentum of a particular Fourier component in the reduced coordinate frame. In equation 4.12, the full linear 
phase is accounted for in dispersive, non-paraxial propagation. 
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Equation 4.11 shows that linear propagation can be described by appropriately evolving the phase of each spatio-temporal 
frequency component of the wavepacket. The complex amplitude A (A:;,, ky, 0, Am) of the wavepacket for each transverse spatial 
and temporal frequency is known by specifying the initial field distribution for all time on a plane orthogonal to the direction 
of propagation. Each point in the three-dimensional Fourier space ik„ky,Aa)) has an associated longitudinal wavenumber 
as prescribed by equation 4.12, which is multiplied by propagation distance to determine the proper phase for each spectral 

component. 

A plot of the z-directed wavenumber is shown in Figure 4.2, where 

k, = y/kHA(n-i-(Oo)-k^ ( 4 - 13 ) 

in laboratory coordinates for one transverse spatial frequency, k,. When L becomes imaginary, i.e. when k, > A:(Am + mo), 



Figure 4.2; Spatio-temporal momentum space. This surface is calculated using the parameters of fused silica. The center 
frequency of 3770 rad/ps corresponds to a free-space wavelength of 0.5 pm. 

the associated Fourier component propagates with large attenuation. Here, k; is set to zero for these evanescent waves for 
clarity. As shown in the figure, the radius of curvature of the momentum surface increases with frequency. The transformation 
to reduced-time cixirdinates removes the first-order slope of the surface, A.q = /."^(mo), where 


A'(mo) = 


and results in the removal of a constant group delay, as shown in equation 4.12. 


4.1.1 Spatio-Temporal Group-Velocity Space 

The momentum surface given by k- can be approximated about the carrier frequency (Do by 

k,{k,,Am) ss k,(k, .0) -I- A(nk',(k„0) -H ^k”(k„ 0), 




and k”{kx,0) 


d^kz{kx,Vi) 

90)2 


(4.15) 

(4.16) 
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thereby giving the j-directed group-delay and group-delay dispersion as a function of transverse spatial frequency. Taking the 
first and second derivatives of equation 4.13 with respect to frequency results in 




\f^0 ~~ 


kpk’o 

kz(k.,0) 


(4.17) 


;"/i _ ^'o^'o k-kpIcQ k-^Q 

k,[kr, 0 )- • 


Noting that k- = cos 0 = A-.v/ tan 9, the approximate momentum space can be written 

, Aw) w Aq cos0-f- 


AwA-q Aw- r sin-0An-l 


cos0 2 


COS0 cos^ eAoJ' 


(4.18) 


(4.19) 


This approximate expression yields some important insight into phenomena (termed here space-time dispersion effects) ob¬ 
served in the numerical simulations of Chapter 6, as now discussed. 

Since the group velocity = 1 /Aq, from equation 4.19. the z component of the group velocity is 


v^;(A,,,0) = V 5 COS 0 (4.20) 

which is also known as space-time focusing [144]. Space-time focusing refers to the change in the •-projected group velocity 
with the tilt anale of an off-axis plane wave component within the angular spectrum of the pulse and describes the natural 
curvature of the energy front in the spatio-temporal diffraction of a narrow pulse. The 2 component of the group velocity is 
plotted in Figure 4.3 versus transverse spatial frequency and wavelength (k = 27tc/w). The group velocity as a function of 



Figure 4.3: Spatio-temporal group velocity. This surface is calculated using the parameters of fused silica. The dashed curve 

separates the regions of anomalous and normal group-delay dispersion. 


spatial and temporal frequency is given by the expression 


A':(Aji w) 
A(w)A'(w)’ 


(4.21) 
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where co = Ao + coq. This expression is the inverse of the generalization of equation 4.17, which gives the group-delay at the 
central frequency coq. Note that in Figure 4.3, the group velocity is plotted versus wavelength instead of frequency. 

Off-axis, the z-projected group delay dispersion coefficient *1' can be negative even when material dispersion dictates that 
/:''((!)) > 0. This occurs when 

^ kx ^ ( 0 for /:^'(co)<=0 (d7'>'\ 

^/k(o^)k^^((^/k'{(i>) for r(a))>0 ■ 

The boundary between the normal (k" > 0) and anomalous {k" < 0) regimes is shown in Figure 4.3. An effective anomalous 
regime occurs because the curvature of the group velocity surface in the spatial-frequency direction varies along the temporal 
frequency direction. When the spatial frequency curvature is large for small temporal frequencies and small for large temporal 
frequencies, as is typically the case, a region of effective anomalous dispersion can exist, as illustrated in Figure 4.3. This effect 
may have ramifications for spatio-temporal solitary waves, such that a region where k" <0 may not be necessary for pulses 
with sufficient angular bandwidth, but will not be explored in this thesis. Note that this is a spatio-temporal effect and does not 
occur for purely temporal propagation. 

For completeness. Figure 4.4 shows the z-projected group-delay dispersion surface FJ where 


k'!(k-,<a)=: 


k^juj)!-" (d)) — klk((}i)F'{(a) - k~F"(a)) 
kl(kx,(a) 


is plotted as a function of wavelength. Figure 4.4 clearly shows the enlargement of the anomalous dispersion regime due to the 



Figure 4.4; Spatio-temporal group-delay dispersion. This surface is calculated using the parameters of fused silica. The dashed 
curve separates the regions of anomalous and normal group-delay dispersion. Note that GDD is in units fs~/pm rather than 
ps-fpm. 

angular curvature of the momentum space. It is also evident that temporal broadening, which to first order is determined by 
the group-delay dispersion, varies off-axis. This is in large part due to the second-order space-time dispersion term obtained 
in the multiple-scales analysis, which is perhaps more appropriately referred to as the true space-time focusing term. In the 
anomalous regime, the projection of the wave onto the z-direction will broaden more with increasing angle, while in the normal 
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regime, the broadening will first decrease and then increase as the group-delay dispersion passes through zero with increasing 
angle. As a final note, the magnitude of z-projected third-order dispersion also changes dramatically with spatial frequency, as 
evident from the steepness of the group-delay dispersion surface at high spatial frequencies in the anomalous regime. 


4.1.2 Linear Spatio-Temporal Propagation 

This section briefly examines (2+l)-D linear spatio-temporal propagation in order to provide a graphical representation of the 
effects of space-time focusing. More detailed analysis of linear propagation is presented in Chapter 6. The initial condition for 
propagation is a transform-limited spatio-temporal wave whose shape is calculated numerically as the lowest-order eigenmode 
of the1;2-Hl)-D NLS equation, and corresponds to the a = 0 solution obtained in section 2.5, with normalized peak amplitude 
Uo = 1 •94- Using the same parameters as the simulations in Chapter 6 (i.e. wq = 39.6/im), the spatio-temporal wave has spatial 
full-width at half-maximum (FWHM) of 40.7 pm and temporal FWHM of 16.5 fs with central wavelength 1.55 pm, leading to 
fractional bandwidths of K ~ 0.01 (not to be confused with the normalized interaction angle) and ti ~ 0.1, respectively, which 
indicate that temporal (non-SVEA) effects will dominate. 

Figure 4.5 shows the result of propagation over 7.5 confocal distances under the paraxial and SVEA approximations, as 
given by the (2-i-l)-D linear NLS equation. 


dA d-A 


(4.24) 


which only describes paraxial diffraction and group-delay dispersion. The (2-rl)-D propagation geometry is illustrated in 
Fieurc 1.20, in which two spatio-temporal waves initially overlap at z = 0, with one propagating at an angle (dashed contours) 
such that the spatio-temporal frequency spectra are separated by twice the angular FWHM. This condition leads to a normalized 
interaction angle K = 2. where K is defined in section 5.1.1, and translates to approximately twice the linear resolvability angle, 
but is not exact because the waves are not Gaussian. The waves broaden by a factor of about 6.1 in each dimension after a 
distance of 7.5 2(). where the confocal distance Zq is defined as twice the distance over which the FWHM increases by a factor 
V 2 in linear propagation. Because there is no space-time focusing, the energy fronts have no curvature or tilt. 

For comparison. Figure 4.6 shows propagation under the same initial conditions but with the exact Fourier phase siven 
by equation 4.12. In this case, the spatial broadening factor is 6.0, while the temporal broadening factor is 5.9. This slight 
a.\vmmetry is expected because the governing equation is not symmetric in space and time. First-order space-time focusing is 
clearly shown in the on-axis spatio-temporal wave by the dashed curve, where the energy front is delayed (positive reduced 
times) for off-axis rays. The steepening of the leading edge (negative times) is also evident. This effect is the result of the 
interaction between anomalous group-delay dispersion and positive third-order dispersion. 

The off-axis wave exhibits more complicated structure. The mean tilt angle corresponds to a spatial-frequency shift of about 
0.07 rad/;/m. which, from the group velocity and group-delay dispersion surfaces, produces a noticeable change in propagation 
parameters as shown in Figure 4.6. In addition to curvature of the energy front, the nonzero tilt of the wave also causes a delay 
of the central portion relative to the on-axis wave. The increase in the magnitude of z-projected group-delay dispersion with 
increasing angle, or second-order space-time dispersion, is evident from the figure as well by the visible increase in temporal 
broadening. 

Because the parameters are the same as used for the spatio-temporal solitary wave simulations presented in Chapter 6, the 
higher-order linear effects observed here will be present in the nonlinear simulations as well. The present section also shows 
that the paraxial. SVE.'\ NLS equation will not adequately describe spatio-temporal solitary wave propagation for the regime 
ofmlcresl. 


4.1.3 The Linear Terms in the Higher-Order NLS equation 


This section uses the exact expression for forward-going linear spatio-temporal propagation given by equations 4.11 and 4.12 in 
the reduced time coordinate frame in order to obtain the linear terms in the evolution equations derived in the previous chapters. 
This shows the equivalence between the descriptions in the Fourier domains and in the real-space domains and leads into the 


discussion of the split-step numerical method. 

Ignoring evanescent waves, i.e. k^ + k- = kf < /.--(Aco-f coq), the z-directed momentum given by equation 4.12 can be 
approximated about kf =0 ^ 

krz « k(Am -f (Do) - ...^ ^ -r - 

" ^ 2A(Ao)-|-COo) 


k^ 


8/:3(Aa)-F(Oo) 


- Aco/:() 


(4.25) 


where only the first three terms of the Taylor expansion of the square-root are kept. The standard paraxial approximation 
keeps only the first two terms in the expansion; this approximation is made in the NLS equation. Retaining more terms in the 




Final Report AFOSR F49620-95-1-0431, Spatial Soli ton Interactions, S. Blair & K. Wagner, U. Colorado, Boulder 



Figure 4.5; Linear diffraction under the paraxial and SVE approximations of initially overlapping spatio-temporal waves with 
tihed directions of propagation. The spatio-temporal waves of initial FWHM of 40.7 ^m and 16.5 fs broaden to 247 ^m and 
99.S fs after propagating 7.5 confocal distances. The dashed line indicates the position of the energy front (defined by the 1-D 
temporal centroid) across the spatial profile, and the contours are spaced by 3 dB intensity intervals. 
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rijiurc 4,6: 1-ully non-para.\ial and non-SVEA linear diffraction of two spatio-temporal waves, one tilted with respect to the axis 
i)f propagation. The on-axis spatio-temporal wave of initial FW'HM of 40.7 pm and 16.5 fs broadens to246/tm and 97.2 fs after 
propagat'm-: 7.5 confocal distances. The dashed curve indicates the position of the energy front (defined by the 1-D temporal 
centroid) across the spatial profile, and the contours are spaced by 3 dB intensity intervals. 
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expansion beyond the first two relaxes the paraxial approximation, but evanescent modes must be neglected in order for the 
expansion to converge. In this section, the first three terms will be kept in order to reproduce on a more intuitive basis the linear 
corrections to NLS that were obtained in Chapter 3. 

Equation 4.25 still contains full linear dispersion. The slowly-varying envelope approximation (SVEA) assumes that the 
temporal variation of the envelope A is slow compared to the mean oscillation frequency coq. The frequency-domain interpreta¬ 
tion is that the fractional bandwidth q = Aco/coq, where Aco is the FWHM of the spectral intensity envelope and (Oq is the central 
frequency, is less than unity. These two measures are related by a factor called the time-bandwidth product. The dispersion 
relation only needs to be known over the frequency range in which the wavepacket spectrum A is localized, implying that a 
Taylor expansion of the dispersion relation is sufficient. Thus the narrower the frequency width (or the longer the duration), 
the fewer terms in the Taylor expansion that need to be retained. The SVEA only keeps the first three terms in the dispersion 
relation, but the first five terms will be used in this analysis for comparison with the more complete derivation in Chapter 3. 

/:(Aa) -h (ho) ^ ^ + AodAq -h - Aco'/cq -h -Acd^/co^ + — Aco'^A'o^', (4.26) 

which includes up to fourth-order dispersion. Using this expansion in equation 4.25 (which removes the mean group delay) and 
keeping terms only up to fourth power in A 7 , Ao) and their products. 


krz ^ 


iAco-X" + ^AcoX"' 


! [Aq 4" Aco Ay -r Aco^Aq] 


; - AcjO"Ao 4* - Aw^Aq' 4- 


Aco-A^f 


Aco“Ao 


The terms obtained in addition to those present in standard NLS are third- and fourth-order dispersion, space-time dispersion 
and a non-paraxial correction. 

The space-time dispersion terms from equation 4.27 can be written 


T A“ 1 1 T f 

STD = AcoA() 1 + :^ - AtoA;,-Aur -h 
L J --I 




Noting that k] = Mn’O (% 0- in the paraxial approximation) the terms are written 


STD = Aci) 


A--sin=e^^^- 

cosO X'O 


To order O ', these terms are the same as the associated terms in equation 4.19. The first space-time dispersion term (called space- 
time focusing 1144) I describes the off-a.\is variation in group delay in reduced coordinates while the second term describes the 
off-axis variation in group-delay dispersion. At this level of approximation, the region of anomalous group-delay dispersion is 
described by sinO > \ kk"/k' when k" >0. 

Concentrating for the moment only on the spatial-lrequency terms in equation 4.27 results in, 

/ - r-* 

1. _ , ’'I '‘7- 74'tm 


Figure 4.7 shows a plot of the fully non-paraxial /.-space, the paraxial approximation given by the first two terms of equa- 
tio'n 4.30. and all three terms of equation 4.30. The figure clearly shows that the three-term expansion better approximates 
the exact /-surface than the two-term expansion given by the paraxial approximation, and is valid over much broader angular 
bandwidths. 

In order to obtain the linear corrections to NLS in the real-space domain, equation 4.11 is written in differential form using 
the approximated kr; from 4.27 


— = / //(I -)- - Aa)-/:o -f -Acu^/to' + ^A(o‘*/:o” 


^0 , A( 0 -/.f 

I ^-6 


Atrrk'Q 


_ 4 \ • 

8/3/' 
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0.0 0.2 0.4 0.6 0.8 1.0 

k2(relative to ko) 

Figure 47; Illustration of the paraxial approximation. The solid curve is the exact i-space representation for isotropic media. 
The dashed curve is the paraxial approximation, while the dotted curve is the first-order correction to the paraxial approximation. 


Insersc Fourier transforming leads to 


dA 


dz 


d- 




dT^ 24 ° dT-* 


(4.32) 


-F 


2/to [ kodT /t^ar- 2kodT-\ f 


which IS rew ritten in more familiar form by making the transformation AA e\p{ikoz) to remove the fast propagation phase 

(4.33) 


") / 4 I '" ^ * i I'" ^ I ^ f- ^ 

4- V 7-/4 — Alo^o 0 ^ — ^^ 0^0 + p ^*-0 ^ y -4 

Av'r d- k'^ d-A 




_ _iL_+ 


‘/todT kldT-^ IkodT- 

which IS the same as the linear part of equation 3.170 derived in Chapter 3. 


4kl 


-0 


4.2 Derivation of the Split-Step Method 

The vector, higher-order nonlinear spatial 3.169 and spatio-temporal 3.170 wave equations can be written in the following 
noialional form 

(4-34) 

oc 

with a corresponding equation for Ay. Here, all linear and nonlinear effects are lumped into the operator Q. The forward-going 
solution to equation 4.34 as a function of propagation distance c is written 


(4.35) 
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where Aj:(0) is the initial value in space and time of the .v-polarized envelope at z = 0. 

Equation 4.35 can be examined more closely by writing the operator in terms of linear homogeneous and inhomogeneous 
parts Q = Q^ + Choosing a small propagation distance Az, and noting that can be integrated directly, equation 4.35 


reduces to 


A;,(AZ) = 


(4.36) 


The integral can be approximated to second-order accuracy in Az 

|^^0'«(zVz' = j[^|(2'”(Az/2) -b [z' - Az/2] |g'»(Az/ 2) + 0(Az-)|rfz' 

= G'«(Az/2)Az-f ^ [z'-Az/2]'|^|e'«-bO(Az-^) 

= e'“(Az/2)Az-t-0(Az^), ' (4-37) 


SO that the propagation equation becomes 

= (4-38) 

Even thoush equation 4.37 is a second-order approximation to the integral of the nonlinear operator, there is no guarantee that 
the actual implementation is second-order accurate, since self-consistent envelopes at Az/2 are unknown. The approximation 
u.sed in the split-step method is that these values are determined by a Az/2 linear propagation step, thus neglecting the nonlinear 
contribution. The result is that the nonlinear step is nominally first-order accurate, as shown in section 4.3. 

The exponential can be rewritten in the well-known symmetrized form [232] 

(4.39) 

which is second-order accurate in Az and requires three operations for one longitudinal propagation step. This separation into 
linear and inhomogeneous steps is the main characteristic of the split-step method. 

In order to verify the second-order accuracy of the splitting, the Taylor expansion of equation 4.39 is compared to that of 
equation 4.3S, First Taylor-expanding equation 4.38 re.sults in 

^ ^ q\- + qW ^ qLqL + 

+ Q"* (^) 0‘-+0'” (y) e'” (y)] +0(Az-^) • (4-40) 

The Taylor-expansion of the exponential operators in equation 4.39 to second-order is 

[l-H-^(2‘--^e^-(2‘'-bO(Az^) x 

2 o 

'l-b/Az(2'« (y) - (y) C'" (y) + 0(Ac-^)] X 

l+/yQ‘-^e''(2'+0 (Az’) 

= (I)] - f [e^6^+0'-e'“ (|) 

+ (2'» (y) Q‘-+ 0'” (y) (f)] + 

Since both of these expansions are the same to order Az”, the symmetrized representation 4.39 retains the second-order accuracy 
of equation 4.38. 

It is clear from the discussion of section 4,1 that the linear operator is most naturally applied in the Fourier domain, where 
time derivatives are converted into temporal frequency and space derivatives are converted into spatial frequencies. As a result, 
the linear propagation steps are evaluated in the Fourier domain as follows. 


(4.42) 
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where J"' is the inverse-Fourier transform operation, and is the Fourier phase function as given by equation 4.12 or 4.27 

for example. , . 

Now that the symmetrized representation is established, one further point needs to be addressed at this time. Propagation 

over some distance occurs in steps of size Az. From equation 4,39, propagation over 2Az is described by 

A,(2Az) = 

(4.43) 

which only requires five operations instead of six, because the two adjacent half-step linear operations coalesce into one full- 
step linear operation, as shown in Figure 4.1. In general then, for A/ steps of size Az, the total number of operations is 2/V-l-1, 
instead of 3N as suggested by equation 4.39. 

The following section will examine the special case of spatial propagation appropriate for (1+1)-D spatial solitons in slab 
wavesuides with the split-step method. The generalization to the (2+l)-D spatio-temporal method is considered in section 4.2.2. 


4.2.1 Nonlinear Spatial Propagation 


The (1 + 1)-D nonlinear evolution equation appropriate for the spatial soliton studies presented in Chapter 5 is given by equa¬ 
tion 3.169. repeated here for convenience: 


9Av . 9"Av 

ikoCXoA^^ + 


1 dU, 


+ 2ito-{[1 + il^ -h2 [A-H IKAk] \AyfA,] = 0, 


(4.44) 


where the non-paraxial nonlinear terms have been neglected due to weak nonlinearity, and the vector four-wave mixing term 
neeleeted under the assumption of short waveguide birefringence beat length. The transverse y dimension is not considered due 
lohnear confinement by a slab waveguide. For simplicity, this equation does not include the modal-averaged coefficients due 
to lateral confinement as derived in Appendix C. 

Using equation 4.34. the linear and inhomogeneous split-step operators can be written 


^ ~2kod.x^- 2 




^'0 


k~ = y ko j^/:o + ^0 • 

(2'» = kfnz {[1 -1- iK] |A.,|- + 2 [A + IKAk] Kv|'} . 


(4.45) 

(4.46) 


Here, the expressions within the square-root are the exact (assuming weak absorption) operator and Fourier phase represen¬ 
tations of linear spatial diffraction. A similar result was obtained previously [233], with a different, but equivalent, form of 
the linear non-paraxial operator. That work did not consider the nonlinear, non-paraxial terms that were derived in Chapter 3, 
but subsequent extensions 1234] did consider those terms for a purely linear, spatially-varying refractive index. As mentioned 
before, the nonlinearly-induced index for the present studies is small enough that these terms can be neglected. 

For correctness with off-axis plane wave components, the linear absorption term must be applied in the Fourier domain; 
however, these effects occur well beyond the order of the multiple-scales analysis in Chapter 3 due to the assumed order 
smallness of the coefficient a<). A constant (with respect to angle) absorption, which can be brought outside the square-root, can 
be applied in either the Fourier or real-space domain because of Parseval’s theorem. It is more convenient (and more correct in 
terms of the precise behavior in the attenuation of off-axis energy flow) in the numerics to apply this term in the Fourier domain, 
because, if applied in the real-space domain along with nonlinearity, there is an ambiguity whether to attenuate the envelope 
before or after calculating the nonlinear effect. Two-photon absorption must be applied in the real-space domain along with 
nonlinear refraction. 


4.2.2 Nonlinear Spatio-Temporal Propagation 

The (2-1-1 )-D spatio-temporal nonlinear evolution equation used in the simulations of Chapter 6 was derived in Chapter 3. Using 
the full Raman re.sponse in equation 3.170 results in: 


2i/.o + ik^)a^^A_, -I- — kr)l^o 


d-A, 

37- 



d^Ax .k'o d^Ax 
37 ^ ^kodTdx- 


(4.47) 
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+ + + [|/l.l' + 2 AdAvp]A, 


+ AikfUK 1 + ~c~ '2 ^ |Ajrt'Aj + 2A^r |Ay|* Aj:j 

+ [ 7 -I] 1=} 

+Ar rRR(r)\Ay(T-x)\-A,dx 
Jo 

+yRl”RR{x)A;{T-x)A:,{T-x)Aydx 


’«4 r,, 1 "5 


+ 2kl^ (^ + 2A;f [2 Aa- +YA-] ^ - [2A + -r] iA;,|- |Ay|-A., = 0. 

/2o [5 fu J 

The full Raman response is used in the numerics because the spectral bandwidth of the spatio-temporal waves used in the 
simulations exceeds the bandwidth of the Raman gain spectrum. 

The linear operator is written 


1 

^ 2 dr- 6 dr 2ki dTdx^ 24 dr ^ 2 


(4.48) 


« + ^ (“0 + + '■« (lOo + - ^' 0 - 

Since the linear e.xponcntial operator is applied in the Fourier domain, resulting in an exponential phase function, the linear 
propagation phase is written 

( 2 ^ = ^A-(A(i)+'a)o) [A(A 0 ) + C 0 ()) + ia(Aco + a)o)] - k^ - AtoAg - Aq, (4.49) 

which IS the exact linear pha.se k^ as discussed in section 4.1 with the inclusion of weak linear absorption, but only strictly valid 
here when A, < A(Ati)-fo)o) because of the addition of the nonlinear terms. 

The argument of the nonlinear exponential operator is defined 


AyNL: + A 7 NL 4 + /■ 


. 2 r»io A?,l 8NL: 


,10 c 2 j 8 r /' 


(4.50) 


and is applied in the real-space domain. The factor 1/A, is u-sed to handle those nonlinear terms which cannot otherwise be 
written in the form of equation 4.34, such as the last Raman term and the Kerr cross-phase modulation and all Raman terms 
in the first-order shock. This factor serves to remove any fast phase variation (due to nonlinear refraction) from the nonlinear 
terms, which are grouped as follows: 


NLa = tiK [ 1 + i—K [|A.,|- 4- 2Aa lAvpl Ay 

L "a: J ■ J 

+ r«RWlAy(7--T)|-A.,rfT 

4/io Jo 

+ ^ rRR[x)\A,[T-x)\-A,dx 
4/io Jo 

+ r/?/?(T)A:(r-T)Ay(r-T)Ayrfx, 
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I Ha I ft -) 

NL 4 =« 4 i l + 

1 /24 2 / 20/24 
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+ n^{l +y^ [2A/C + Ya'] ^ - \ [4A- " y"] } 1^,1^ A, 

+ n4 { ^ + 2Ak [2Aa: + Ya] — [2A + f ] | |A.x I" \A-y 1" Ax . 

Note that the factor 1 /Ax cancels out from some of the terms in NLt and all of the terms in NL 4 . 

The nonlinear “index” produced by the last Raman term is proportional to the factor 

( 4 . 53 ) 

\^x I 

Since all fast variations, such as the propagation phase, have been removed, and (and thus their difference) have at most 
order 8 " variation. The amplitude ratio serves as a normalizing factor to properly scale the size of the third Raman term in 
proportion to the first and second. 

The convolutional Raman response integrals of the intensity are evaluated using an equivalent Fourier identity, but it should 
be noted that the resultant Fourier domain is different than the conventional temporal Fourier domain of the fields since it is 
detined as the transform of the intensity. For example, 

I^RRix)\Ax{T-x)\-Axdx=!F-^{xRmT{\Ax\‘}}Ax, ( 4 . 54 ) 

where I i.-i the Raman susceptibility, approximated by a single Lorentzian resonance 


Xr(^) = 


Rq 

Qj-Qr-iOi' 


( 4 . 55 ) 


Siimulated Raman scattering is discussed in Appendix B. 


4.3 Implementation and Accuracy of the Split-Step Method 

The derivation of the split-step method indicated that the operator splitting given by equation 4.39 was second-order accurate 
in siep-M/e Ar. In addition, the midpoint method used to evaluate the longitudinal integral of the nonlinear operator, as given 
b\ equation 4 . 37 . is also second-order accurate, but relies on the knowledge of the field at the midpoint position, which is 
onl\ arproximatcly provided by the half-step of linear propagation. As a result, when the magnitude of the field changes 
sutiK ienilv rapid!) due to absorption, diffraction, etc., the accuracy of the entire method may scale with step-size worse than 
qu.idraiicalK The rea.son for this is that self-consistency is not strictly maintained since linear and nonlinear propagation are 
handled scparaiel). meaning that the nonlinear phase accumulated over the interval Ac may not be properly accounted for. 

.\ simple, intuitive explanation is provided by the consideration of the propagation of a spatial soliton using the split-step 
method .Starting with an initially transform-limited beam, the first half-step of linear diffraction results in broadening, or 
detocusing The subsequent full nonlinear step, based on a slightly erroneous field value, imparts a focusing quadratic phase 
ih.it over compensates the pha.se curvature accumulated during the half step. The next full linear step first results in focusing to 
a vv .list ( close to the original soliton form), and then broadening. This process is repeated over the entire propagation distance. 
It IS clear that the beam approximately returns to its original soliton form only at distances which are multiples of the step 
size, and defocuses and refocuses in between. The actual error arises from the fact that the nonlinear phase is applied based 
on an approximate value for the field determined by linear propagation. The result is that the nonlinear phase may be no more 
accurate than a simple, first-order, forward Euler scheme. 

The easiest way to numerically determine the order of accuracy of an algorithm is to compare the results of simulation with 
varying step size with that of “the best” simulation (i.e. using a very small step size). Another approach is to compare the results 
of simulation with an analytic solution, from which additional information can be gleaned, such as the absolute accuracy of 
the simulation, including the algorithm and model. This is a straightforward procedure when using solitons because the exact 
solution (in 1-D» is known and doesn’t change (except for the usually unimportant nonlinearly-inducedphase) with propagation. 
Both methods are used here. 

In the following sections, numerical accuracy will be determined for various situations for both (l-i-l)-D spatial and (2-1-1)- 
D spatio-temporal propagation. For each case, the normalized root-mean square, or RMS, error is calculated between the test 
simulation and the reference: _ 

^ " 1 

where the field amplitude 4'“' is calculated as a function of longitudinal step size. 


( 4 . 56 ) 
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4.3.1 (1+1)-D Spatial 

Figure 4.8 shows how the numerical accuracy scales with step size for a single soliton propagating without absorption. The 
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Ficurc 4.8: Plot 4 )f percentage RMS error as a function of step size for the propagation of a single spatial soliton without 
absorption using the split-step method. The data points represent results from the simulations while the lines represent error 
scalings ot‘ first and second order. 

smallest step-size plotted in the figure is Az = Zo/1024. and the "e-xact" solution to which the others are compared is the result 
obtained when Ar = Zo/2048. The longitudinal step size is written in terms of the confocal distance Z) becau.se the Raleigh 
ranee Z^) '2 is generally considered as the distance over which noticeable changes of the wave appear in linear propagation. 
Therefore, the linear longitudinal steps in the split-step method must occur over distances much shorter than Zo. 

The simulation is performed using the initial field profile sech(,v/uo). where uq = 17.3 ^m. resulting in a spatial full- 
width at hair-ma\imum (F\VHM) of 30.5 pm. meaning that the paraxial approximation is valid becau.se the fractional angular 
bandwidth K = 0.01. The propagation distance is 10 Z,. As show n in the figure, for large step sizes, the error decreases' linearly 
with decreasing step size. In addition, as step size decrca,ses pha.se this linear region, there is a large region over which the error 
.scales quadrati'cally. as naively expected. Finally, at the smallest step sizes, the scaling approaches linear again. The important 
point to note is that there is a large region over which second-order accuracy is obtained. This result thus verifies the order of 
the numerical method when stationary propagation is considered, but does not address the absolute accuracy when compared 
to the exact analytical solution or the order of the method for non-.stationary propagation, which may be different. 

The comparison between the numerical method with the theoretical soliton solution without absorption is shown in Fig¬ 
ure 4.9. This figure illustrates the same behavior as show n in I-igurc 4.8, except for the presence of a distinct error floor beyond 
which the simulation can be no more accurate. It therefore makes no sense to decrease the longitudinal step-size below that 
which reaches the error floor. 

The error floor is the result of discretization in the transverse dimension. Like the continuous NLS equation, the discretized 
NLS equation used in the numerics possesses a set of eigenmode solutions, which are different than their continuous coun¬ 
terparts. The numerical simulation is started with an eigenmode of the continuous NLS equation, which then evolves to an 
eigenmode of the discretized equation during propagation, leading to the absolute error shown. For the results of Figure 4.9, 
the 1024 element computational grid allows for 55 samples under the transverse spatial full-width at half-maximum (FWHM), 
and 6 samples under the spatial-frequency FW'HM, resulting in an error floor of 4 x as shown in the figure. Increas¬ 

ing the computational grid to 8192 elements and simultaneously decreasing the transverse sampling interval by a factor of 4. 




Final Report AFOSR F49620-95-1-0431, Spatial Soliton Interactions, S. Blair & K. Wagner, U. Colorado, Boulder 



0.001 0.010 0.100 1.000 

longitudinal step size (in Zg) 


Fiiiure 4.9: Plot of percentage RMS error between split-step simulation and the exact analytic form in the propagation of a 
single spatial soliton. The error decreases quadratically with step size, then reaches a floor at about 4 x The lines 

represent zeroth, first, and second order scalings. 

with 221 samples under the spatial FWHM and 12 samples under the spatial-frequency FWHM, decreases the error floor to 
2 ■ lO”"*' !. .As the number of transverse samples becomes larger, the eigenmode of the discretized equations approaches that 
of the continuous equation, resulting in a decrease of the error. It is also found that increasing the number of samples under the 
spatial frequencs FAVHM has little effect on the error. 

In Chapter simulations are performed in which the interaction between two spatial solitonsof different widths is studied. 
It IS therelore necessary to evaluate the error as the soliton width decreases, such that propagation no longer satisfies the paraxial 
approximation. Figure 4.10 plots error as a function of step-size fora pump soliton width = 2.16AJm, or FWHM 3.81 //m, 
with fractional angular bandwidth K = 0.1. In addition, the induced third-order nonlinear fractional index V: ~ 0.1, and the 
propagation distance is 10 Zy. where 2b is the confocal distance of the signal soliton of^width wq = 17.3 pm. Now, the step 
size Is \sriiien in terms of the confocal distance of the pump, or Z/,. where Zq = [wo/wp]"Zp = 64 Zp. Again, as shown in the 
figure, there is a region where the error decreases quadratically with step size, but now the error floor is significantly greater at 
0.0491. but IS still acceptable for most situations. The simulation is performed on a 2048 element grid with 14 samples under 
the spatial FWHM and 47 samples under the spatial frequency FWHM. Increasing the grid size to 8192 elements (with 55 and 
47 samples respectively), reduces the error floor to 0.029;. a significant decrease, but still two-orders of magnitude higher than 
the fully paraxial case. 

In this case, ihe transverse sampling is not the limiting factor. The main limitation is the fact that the simulation was 
performed with a soliton width that leads to non-paraxial propagation. Even though the linear non-paraxial corrections are 
included in the simulation, the scalings for K and v indicate that the nonlinear, non-paraxial terms are of about the same size 
as the linear non-paraxial correction, and should be included as well. As shown in section 2.3.1, the fully non-paraxial, 
but also non-vectorial, spatial soliton solution is the same as the paraxial soliton, except for an unimportant nonlinear phase 
factor, and is only valid when all relevant non-paraxial terms in the evolution equation are retained, which is not the case here 
for simplicity. 

These results indicate that the accuracy ofthe slit-step method scales second-order with longitudinal step size for stationary 
propagation. The accuracy must also be determined when attenuation due to linear and two-photon absorption is considered. 
In this ca.se, absorption causes broadening of the soliton, as discussed in Chapter 5. Linear absorption is applied in the Fourier 
domain alons with diffraction and is not expected to alter the order of the algorithm. Two-photon absorption, on the other hand. 
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Figure 4.10: Plot of percentage RMS error between split-step simulation and the exact analytic form in the propagation of a 
single spatial soliton in violation of the paraxial approximation. The error decreases quadratically with step size, then reaches 
a floor at about O.CWfT. The lines represent zeroth and second order scalings. 

is applied in the real-space domain along with nonlinear refraction. Since two-photon absorption alters the amplitude based 
upon the approximate field, it is possible that the accuracy of the split-step method scales less than quadratically. Aside from 
absorption, the parameters of these simulations are the same as those used in generating the data of Figure 4.10. 

Fisture 4.11 shows that the error in fact decreases quadratically until the floor is reached, the same scaling as before. The 
analyfic solution used as the reference is obtained in section 5.3, but is only approximate. The small error floor (some of which 
IS due to non-paraxial propagation before broadening occurs) verifies that the analytic approximation is a good one. For all 
single soliton simulations, at most 256 steps per Zp are sufficient to reach the error floor. 

The final simulations to consider are those of the spatial soliton dragging interaction. During interaction, the soliton shapes 
can change considerably, therefore it is not expected that the accuracy will scale second-order with step size. This is confirmed 
in Figure 4.12. which shows that the accuracy is approximately order ohe-half, although there are small regions over which 
the error scales linearly with step size. Absolute accuracy can no longer be obtained because no analytic solutions exist to the 
vectorial NLS equations for non-unity cross-phase modulation coefficient. The reference solution is obtained numerically with 
longitudinal step size Az = Zo/2048, as before. Because the simulation is paraxial, with wq = 17.3 pm for each soliton, only 
1024 transverse .samples are used, with 55 samples across the spatial FWHM. 

Figure 4.13 shows the accuracy when the pump soliton propagates in the non-paraxial regime. Here, Wp = 2.16/im, the 
transverse grid is of size 2048, and the propagation distance is 10 Zq. as before. Even though the overall accuracy scales 
approximately as order one-half, there is a region over which the scaling is order three-halves. There are similar regions in the 
paraxial simulation of Figure 4.13 as noted, but these region are less pronounced. 

The most important data obtained in the soliton interaction simulations of Chapters 5 and 6 is the contrast ratio of the logic 
gate under various input conditions. The contrast ratio is not strongly sensitive to small variations in the fields at the output of 
the gate, so it will depend weakly on the accuracy of the simulation. The contrast of a gate is generally considered acceptable 
when it exceeds a given threshold. By how much it exceeds a given threshold is not important, therefore, some inaccuracy 
in the calculation of the contrast can be tolerated. As a result, the spatial soliton interaction simulations of Chapter 5 use a 
longitudinal step-size of Ac = Zp/256, as mentioned in the single soliton results. 
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Fiuure 4.11: Plot of percentage RMS error between split-step simulation and the approximate analytic form in the propagation 
of a single spatial soliton with linear and two-photon absorption. The error decreases quadratically until it reaches a floor at 
about 0.02^:;. The lines represent zeroth and second order scalings. 

4.3.2 (2-f-l)-D Spatio-Temporal 

It IS expected that the accuracy of the split-step method for (2+1 )-D spatio-temporal propagation is similar to that for (1+1)-D 
spatial propagation. Because of the time-consuming nature of multi-dimensional simulations, this behavior is illustrated in this 
section using the results of two simulations. The grid size for both simulations is 1024 by 1024, with 20 samples across the 
spatial I'WHM and 14 samples across the temporal FWHM. Most of the computational grid is unused, but the size and sampling 
are .set to be consistent for the more general simulations of Chapter 6. The effect of this is to reduce the absolute deviation 
between simulation and exact results, but the order of accuracy will not be affected. 

Figure 4.14 plots error as a function of step-size for spatio-temporal propagation using the (2+1 )-D cubic-quintic NLS 
equation, which has stationary solitary-wave solutions, as discussed in section 2.5. For this simulation, m-q = 39.6 pm as 
before and the propagation distance is 2 Zq, reduced mainly because of the computational time required for multi-dimensional 
simulation. The results of the simulation are compared with the exact eigenmode which is computed numerically. As in the 
(1+1 »-D case, the error scales second-order w'ith step-size until an error floor is reached. Since comparison is made to an exact 
numerical solution, the relative error would increase for longer propagation distances, thus requiring smaller A; to reach the 
error lloor. 

The final numerical result, shown in Figure 4.15, indicates that the accuracy is slightly better than first order when the full 
numerical scheme discussed in section 4.2.2 is implemented for a scalar field. The error is computed using the numerical result 
obtained with the step size A; = Zo/2048 as the reference. Again, due to computation considerations, the propagation distance 
is 2 Zo- Even though the simulations of Chapter 6 use much longer distances, the short distance used here is not a limitation 
because the accuracy scaling is independent of propagation distance. As before in the (1 + 1)-D spatial case, for most purposes, 
256 steps per Zo is sufficient, although in some cases shorter step sizes are used in Chapter 6 because of the longer distances 
involved and the desire for quantitative information. 
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Figure 4.12: PIoi (»f percentage RMS error as a function of step size for the interaction between two (paraxial) spatial solitonsof 
the same size, including the effects of linear and two-photon absorption. The error decreases approximately as the square-root 
of step-size. The line represents one-half order scaling. 
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I•lyurc4 13 I’loi Hi percentage RMS error as a function of step size for the interaction between two spatial solitons of different 
widths, including the effects of linear and two-photon absorption. The error decreases approximately as the square-root of 
step-size The lines represents three-halves and one-half order scalings. 
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I'liiurc 4.14: ol percentage RMS error between the split>step simulation and the exact numerical solution for stable spatio- 

temporal soliiarv wave propagation. Only the effects ot paraxial diffraction, group-delay dispersion, and third- and fifth- 
order nonlinear refraction are included. The error decreases approximately second-order with step-size until a floor at about 
9 > 10”^^; IS reached. The lines represent zero and second order scalings. 
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I-igurc 4.15 ol percentage RMS error as a function of step size for stabilized spatio-temporal solitary wave propagation, 
including all relevant higher-order effects. The error decreases slightly better than first-order with step-size. The line represents 
tirst (>rder scahn^. 





Chapter 5 

Conclusions 


This thesis has developed the necessary framework for the study of general nonlinear, vectorial, spatio-temporal phenomena 
with larce spatial and temporal bandwidths. This framework was applied to the study of novel optical logic devices based 
on the spatial interaction between 1-D spatial solitons and 2-D spatio-temporal solitary waves. These logic gates were found 
to have the properties of full level restoration, fanout with large noise margin, and cascadability to implement arbitrary logic 
functionality. 

Chapter 1 provided motivation for the study of all-optical switching and logic devices, and covered the basic requirements 
for these devices as well. The main differences between switching and logic devices is that an optical logic device regenerates 
degraded data (given sufficient noise margins), has logic-level restoration, provides fanout, and is cascadable. Many contem¬ 
porary switching devices, which do not intrinsically possess these properties, were then discussed: the nonlinear directional 
coupler, the Kerr gate, the nonlinear optical loop mirror, and the terahertz optical asymmetric demultiplexor. It was then shown 
that the intrinsic limitations of these devices could be overcome through the use of optical solitons, which propagate without 
dispersing and/or diffracting. In particular, the temporal and spatial soliton dragging interactions possess the necessary require¬ 
ments of a three-terminal logic gate, and form the basis for the cla.ss of angular deflection logic gate studies of the later chapters, 
for which a nonlinear phase shift less than n can produce a resolvable change in the output state of the device. 

A detailed di.scussion of optical solitons was given in Chapter 2. Following the historical development of solitary wave 
and soliton phenomena. 1-D spatial. 1-D temporal, and 2-D spatio-temporal solitons were discussed. In the 1-D cases, soliton 
.solutions were obtained for higher-order equations beyond the traditional (l-i-l)-D nonlinear Schrbdinger equation. For the 
spatial soliton. this corre.sponds to the scalar non-paraxial solution, while for the temporal soliton, solutions were shown for 
equations that contained higher-order dispersive terms one or two orders beyond the slowly-varying envelope approximation 
(hut without Raman scattering). In the temporal soliton section, the effects of third- and fourth-order dispersion, optical shock, 
and Raman scattering were discussed as well, which have ramifications for the spatio-temporal simulations of Chapter 6. 
Finally, in the di.scussion on 2-D spatio-temporal solitary waves, fundamental eigenmode solutions to the (2+l)-D cubic-quintic 
nonlinear Schriidmger equation were presented, and the stability of these eigenmodes was shown due to the ultrafast saturation 
effect of a negative quintic nonlinearity. The effects of higher-order perturbations on these eigenmodes were discussed in 
Chapter 6. 

The derivation of the fundamental nonlinear wave equations was presented in Chapter 3. First, the integral representation 
of the material polarization expansion up to fifth order was transformed into compact differential operator forms suitable for 
the quasi-monochromatic wave representation. Then, the fully vectorial, nonlinear Helmholtz equation was derived for optical 
wavepackets centered about a single fundamental frequency. Because of the difficulties this second-order equation presents to 
analytical and numerical solution, the following section derived via the multiple-scales perturbative technique, a fully vectorial, 
first-order nonlinear wave equation that consistently includes terms two-orders beyond the slowly-varying amplitude, slowly- 
varying envelope, and paraxial approximations, in addition to terms that describe the vectorial nonlinear coupling with the weak 
longitudinally-projected field and nonlinear coupling with a weak third-harmonic wave which produces an effective saturating 
quintic nonlinearity. This was the fundamental theoretical result of the thesis, and has application not only to the numerical 
studies of later chapters, but to other areas of study as well. For example, considerable attention has been paid recently to (3 -h 1)- 
D nonlinear spatio-temporal propagation in the context of pulse splitting in the normal dispersion regime [152,153]. Simplified 
NLS-type models [ 154) have predicted pulse splitting, but it was also recognized that higher-order terms needed to be included 
in order to follow the subsequent evolution of the split pulses, which have significant angular and spectral bandwidths. More 
recent work has included scalar space-time non-paraxiality with shock [155], and NLS modified with Raman scattering [153]. 
These studies are able to qualitatively reproduce the asymmetric splitting behavior observed experimentally [153], but neglect 
additional physical effects that were derived in this chapter that can also lead to asymmetry. 
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Chapter 4 discussed linear spatio-temporal diffraction and the split-step numerical method used for the simulations in the 
remaining chapters. Diffraction in linear homogeneous media can be performed exactly given the correct initial conditions. 
The diffraction process can be viewed intuitively using the momentum space representation, which is generalized to include 
spatio-temporal diffraction in dispersive media. The split-step numerical method treats inhomogeneous propagation in two 
separate steps. The first step is linear homogeneous propagation over a distance short compared to the diffraction length 
and inhomogeneous length. The second step treats the refractive effect of inhomogeneity due to changes in the linear and/or 
nonlinear properties of the medium over the same distance. Using the symmetrized representation, the split-step method is 
expected to be second-order accurate in step-size. However, in the presence of strong nonlinear effects, the accuracy can 
drop to first-order or less. Even though the scaling can be worse than desirable, comparison of the numerical results to exact 
analytical and numerical solutions show that sufficient absolute accuracy can be obtained in the simulations. 

The heart of the soliton interaction studies for optical logic gates is Chapter 5, which considered logic gates based on spatial 
soliton interactions. Specifically, the spatial collision and dragging interaction between orthogonally-polarized spatial solitons 
were found to provide the best performance for logic gates. These interaction were then studied in detail using the threshold 
contrast metric to find optimal operating parameters, where the dragging interaction generally produced better results. Then, the 
effects of linear and two-photon absorption on the propagation of a single soliton were studied. Figures-of-merit were derived 
to evaluate the suitability of a particular nonlinear material for soliton logic applications. The spatial collision and dragging 
interactions were then evaluated in the presence of absorption with the conclusion that, due to shorter effective interaction 
distances, the dragging interaction again provided better performance. Finally, using the metrics of small-signal gain, large- 
signal gain, fanout, and noise margin common in electronics, logic gates based on these interactions were examined. Subsequent 
cascaded studies show that a sequence of controlled inverters, in which the pump output of one stage divided by the fanout factor 
.serves as the signal input to the next stage, which in the asymptotic limit forms a stable ring oscillator, results in complete logic 
le\cl restoration and fanouts of two or greater with large noise margin. An additional cascaded geometry, in which the pump 
output of one stage serves as the pump input to a second stage, implements a two-input NOR gate. This gate was shown to 
posse.ss complete logic-level restoration as well, with fanouts of two or greater and logic levels compatible with the single-stage 
inverter. These studies form the second major, systems-level, contribution of this thesis, and are perhaps the first time that such 
studies have been undertaken for all-optical devices. 

The final results chapter. Chapter 6, studied logic gates based on 2-D spatio-temporal solitary waves. Stabilized propaga¬ 
tion against the higher-order effects of third- and fourth-order dispersion, space-time focusing, optical shock, and intra-pulse 
stimulated Raman scattering, was demonstrated due to quintjc index saturation. However, it was found that downshifting due 
to Raman scattering was the most detrimental effect to asymptotically stable propagation. Using this stabilized spatio-temporal 
wave as the pump, logic gates based on the spatial dragging interaction were studied. The single-stage inverter was shown to 
provide large-signal gains of two or greater, but cascaded operation proved more difficult due to the strong Raman downshift of 
the pump which serves as signal inputs to subsequent gates. After equalizing the group velocities of the interacting waves in the 
cascaded stages, the cross-Raman downshift caused strong energy depletion of the pump, which resulted in spatial broadening 
and dispersive wave generation, and reduced contrast. Using a weaker proportion of Raman to Kerr nonlinearity (more appro¬ 
priate for a material such as AlGaAs), fanouts of two or greater were obtained in cascaded operation of inverter and two-input 
NOR gates with complete logic-level restoration. 

The studies of this thesis pave the way for the experimental implementation of low-energy, ultrafast, all-optical logic gates 
for a varietv of applications such as communications switching, routing, and coding, and special-purpose digital computing. 
The results show that, even though materials i.ssues pose a great challenge for ultrafast systems with low average power, 
experimental studies using existing materials are warranted and could lead to new capability not possible with current or future 
electronics. 
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Appendix A 

Symmetry Properties of the Nonlinear 
Susceptibility Tensors 


The symmetry properties derived in this appendix are the intrinsic index/frequency permutation symmetry and spatial symmetry. 
The intrinsic symmetry is a fundamental property of the susceptibility tensors, while the spatial symmetry depends on the 
particular choice of material class. The two triclinic classes are the only ones in which the number of independent elements 
cannot be reduced. This thesis deals mainly with the isotropic symmetry class (i.e. glasses, liquids, gases), but the results are 
valid for cubic 432 (i.e. LiFesOs), 43m (GaAs. CdTe) and m3m (Ge, Si, GaP, ZnSe, ZnTe) as well with minor modifications. 

First, the intrinsic index/frequency permutation symmetry property is discussed in the general case of -order nonlinear 
polarization. Then, the spatial symmetries are discussed for the specific cases of the third-order and fifth-order polarizations. 

A.l The Intrinsic Permutation Symmetry 

In order to present a general proof of the intrinsic permutation symmetry property, the fully dispersive /i'''-order nonlinear 
C()n\x>lulion integral is used: 

p['‘\t] = Zof [ ■-.[ a„(Tl.T2.T„)£a,(?-Tl) (A.l) 

^ , Jo Jo Jo 

Fu, ('- T;)...£cx„ (t - T„)r/Ti r/T2 • • • . 

The spatial dependence is dropped for convenience. The form of equation A. 1 is a result of the application of time-invariance [220], 
which, when combined with the causal nature of the response function, leads to the intrinsic permutation symmetry property, 

as discussed later in this section. 

Followini: the same steps used for transforming the linear response integral in section 3.1.1, the nonlinear polarization A.l 
IS written in the temporal frequency domain as 

=£o|...|xyu, - 

Fcx„(o)„)5(a)-Q)| - ...-(U,i)<fMi 


where the /;''' order nonlinear susceptibility tensor is defined 

Xia, ..a„ (“’^1 .= /•■•/- 

Here, 0 is the Heaviside step function that explicitly enforces causality. Note that all factors of 1/271, due to the definition of 
the Fourier transform pair (given by equations 3.20a and 3.20b) in terms of the radial frequency (o, are neglected for simplicity. 
These factors cancel upon return to the time-domain representation. 

In the non-degenerate frequency case of equation A.l, the n fields in the product can be arranged in any order with the 
corresponding rearrangement of the indices and frequency arguments of the susceptibility tensor. There are n\ such equivalent 
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arrangements 


x'/aja, .a„(0);COi ,C0:.CO„)£a| (®i)£a2(W2) • • .£a„(tO„) - 

Xy^a,...a, W:, • • - -WOEa, (W„)£a,(C02) • • -Ea, (Wj) = 


X;a„a„.,...a, 

The field products in equation A.3 can appear in any order by the commutation of the multiplication operator (there are nl 
equivalent arrangements of the fields) and results in the intrinsic index-frequency permutation symmetry of the x tensors: 

For x' ". the intrinsic permutation symmetry property states that 

Xju„,(a;o)i.o}2.(nj} = X;w(®;“i.®3,“2) = 

Xj;*„,(a);(0:.0)|,a)3) = X37m*(w;“2>“3-“i) = 

Xm;(“;“3-W|.Ci) 2) = X;m;i(W;®3,W2> Wi), 

lor example. 

I-ull index-frequency permutation symmetry, or Kleinman symmetry, results when the n"'-order interaction is non-resonant, 
meaning that x'"' is real [215]. In the case of general phenomena for example, Kleinman symmetry requires that all 
one-. uu>-. and' three-photon transitions are virtual (i.e. the medium is transparent to all frequencies involved), but in the case 
oi nonlinear refraction, the requirement applies to just the one- and two-photon transitions. It is apparent that the Manley- 
Roue relaiions hold under these conditions. Kleinman symmetry means that the first index/frequency pair takes part in the 
permutation, with a total of (/) -f 1)! arrangements. In addition, under Kleinman symmetry, the (/i -F 1) frequency arguments can 
be permuted independently from the indices [220]. For generality and to include the effects of one- and two-photon absorption, 
Kleinman svmmetry is not invoked for the third-order susceptibility in this appendix, although it is invoked for the fifth-order 
suNeepiibilii\ in order to simplify the derivation, without loss of validity, as explained later. 

A.2 Symmetries of 

In The ease »>! the third-order polarization, equation A.2 reduces to 

(O)) =z So J j J (A.6) 

5(0)-0)1 -0)2- 0)t)^/o)|^/0)2^0)3. 

The X ‘ tensor has SI elements. The intrinsic permutation symmetry property is used in combination with the spatial symmetry 
reduel ions to reduce the number of independent and nonzero elements for a given nonlinear process. The reductions due to 

spallal symmeir\ are considered ne.xi. _ 

For the more general case of the cubic crystal symmetry classes 432, 43m or m3m, there are 21 nonzero elements of the 

X ' tensor of which only 4 are independent [2201: 


(3 1 i3- >'3) 

X-t-tVV — XnET — Xv\l 




J.T _JT. 

>Crv\.\ — /^\zc\ — A.'v-v-yv — A.yc2y 


.yO) -yO) 

— A^yy 


yd) _y(3) _yO) 
JL\z\z “ /Lzxz,x — Azyzy 


.yO) 

' Kzxxz ^ JLzyyz 


where y A. For the isotropic symmetry class, only three elements are independent because of the additional requirement [220] 

Aside from using the spatial symmetries, equation A.6 cannot be further simplified until the nonlinear mixing process is 
specified. In this thesis, the third-order processes of interest are the frequency-degenerate case of nonlinear refraction, third- 
harmonic generation and third-harmonic downconversion. The latter two processes, in cascade, result in effective fifth-order 
nonlinear refraction at the fundamental frequency. These three processes are now discussed separately. 
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A.2.1 Nonlinear Refraction 

Using the quasi-monochromatic approximation 3.21. equation A.6 can be written for nonlinear refraction 

= j J J J Xyt;„,(co;(0i,co:,co3) X (A.9) 

^{c0|-f o)o)A/(co 2 tno)Ani{o)3 tt)o) 

+A*.((1)1 -a)o)A; (a)2 + Wo)'4m(C03-(Oo) 

+;4*(a)| -a)o)A;(co2-(Oo)A*,(co3 + a)o)] x 
5(C0 — tOi — 0)2 — C03)^C0i^C02^i(03, 

with corresponding equation for the conjugate third-order nonlinear polarization with time dependence centered about -coq. 
Now. the third-order susceptibility tensor is Taylor-expanded about the center frequencies COq and -(Dq. Taking the first term in 
equation A.9 for example 

Xjx/m(w;a)i,0)2,(03)/4;.(a)i -ha)o)A/(co2 ~(Oo)A,„((l)3 — odo) = 

. , , 3X7r/ni(“o;(«i.Wo.“o) 

Xyit/m(“o;-“0,“0.tt»o) + tWl+tOo] 5“ +’" 

(Oi^-tOn 


(Oi^-tOo 




-1- It!) '. - OJo] 

/ij((Ui -+-a)o)/4;((02-(0())A„,(ti)3 -(Do). 


3 x; -(Op. top, ( 1 ) 3 ) 


It l^ evident that each term in the susceptibility expansion is subject to the spatial symmetry reductions, but the fact that the 
derivative terms are subject to the intrinsic index-frequency permutation symmetry is less obvious. In order to see this, the 
pniper Tayli>r e.xpansions are used in equation A.9 which is subsequently transformed back into the time domain, using a 
pr!>cedure similar to that outlined in section 3.1.1. The result is 

i/l = — [y — tUii -f I.(1)11 -f i^.cOd-t- l/f/fr)/!«,(/) (A. 10 ) 


+ Xjkh'i + i^. -ii\< -r I ^ AOt) -t -1 AkirjAj {r )A„,(i ) 

+ Xjkin,+ i^. 0 M,-r 1 ^ i^^Ati!)A,ir}A;jr} e 


where the tiperator arguments are interpreted as the Taylor c.xpansion about the frequency variable and the derivative operations 
arc only perti>rmed on the corresponding field envelopes; thus, ordering ot the field envelopes is important. After the derivative 
operators have been applied to the appropriate envelopes. thet>rderingof the envelopes is no longer important, and the arguments 
of the susceptibiiitv {with the corresponding indices) can be permuted. It is now evident that index-frequency permutation 
symmetry holds. 

The combination of the spatial symmetry reductions with the intrinsic permutation symmetry property is now used to reduce 
the general, fully-dispersive case of nonlinear refraction and the special case of the Raman susceptibility. 

Fully Dispersive Nonlinear Refraction 

Applying first the spatial symmetry reductions of the more general cubic classes to equation A. 10 results in the following 
tabulation of terms 

n Xjjj] 

f 3 3 ^ N 
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2) x^7ii^coo;-coo + i|-,coo + /^,coo + /^j-4}(r)4t(r)AA(0 

3) X;VU-^“o;“o + /^,-(Oo + '^.‘>>o + '^^A;(OAi(/)At(r) 

3) Xy;a-^coo;a)o + <^-Wo + '^.-f^ + '^^A;(r)A/;(f)A;,(/) 

3) Xjkjk(^0’-^o + i^^.tJio + i-^,0^4-i-^'^Al(t)Aj{t)Ai;(t) 

2) Xj/t;(:(^®o;wo + '^.-o^ + /^,a)o + /^^A/,(r)A*(/)At(0 

3) Xjk jk ^(^0; COo + / . 0)0 + / gj, -COo + / ^ ^ A* (/ )Ay (f) A^. (/) 

3) XyW; ^“o;-o)o + '^'0)o + '^-0)o + /^^ A*(r)At(r)Ay(f) 

3) X>«;(^“o;o)o + i^.-o)o + /^,o)o + J^^Ai.(OAi.(/)A;(r) 

2) Xjkkj ^a)o;0)o + /■ |-.0)^, + /1-, -0)0 + i|:) A*(/)Ax.(/)A}(f). 

The same numbered terms are identical by the use of the intrinsic index-frequency permutation symmetry property, thus reduc- 
in^ cquaiii)n A. 10 to 

(A.ll) 

+ ^Xjkkj ^0)()-. -0)0 -I- / ^. 0)0 + / ^. 0)0 + a;, (t )Ai. (r )Aj(t) 

+ Xy;u(o)t.;-0)o + /|:.o)., + *|.o)o + «|)A-(/)4*^ 

where j = k. It is evident that equation A.l 1 is also valid for instantaneous nonlinear refraction by simply removing the time 

derivatives from the arguments of the susceptibilities. 

For the isotropic symmetry class, the following relationship holds 

Xj;Vy(o)o;-o)o.o)o.o)o) = 2x,uy(o)o;-o)o.o)o.o)o) 

+ Xy;u(0)o'.-0)o-0)o.0)o), 

and. under Kleinman symmetry. Xya 7 (o)o;-o)(). 0 )o.o)o) = x„u(o)o;-0)o.0)o.o)o). These relationships are valid in both the 
fully dispersive and instantaneous cases. The instantaneous case is shown for brevity. 

The Raman Susceptibility 

r 

Now. considering the special case of stimulated Raman scattering as discussed in Appendix B, equation A.IO reduces to 

+X;*/m [ At(t)Aj(f)j A„,(r)| e . 

Now. the derivative operator argument operates on the product between two envelopes enclosed in brackets. Notice that since 
the Raman su.sceptibility is a function of one frequency argument, the intrinsic permutation symmetry only applies to the two 
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envelopes inside the brackets. Applying the spatial symmetries results in 

1) ^JJjj 

2 ) [A}(r)A*(r)]A*(r) 

3) [Al.(r)Aj(t)]A*(r) 

2 ) [A*(/)A}(t)]A*(r) 

4) X%kj{^ou^^ [4i(tK.(f)]A;(r). 

Again, the same numbered terms are identical by the intrinsic index-frequency permutation symmetry. With these reductions, 
equation A. 12 can be written 

^r(0 = j{xL7(‘«o;<|)h(OA,(0]A,(r) (A.13) 

+X;a 'j ^ j ['4* (' )Ak (0 ] Aj (/) 

['4l(04;(t) ] A*(/) 

This expression is the generalization to higher-order terms of the result obtained for the study of the Raman effect in AlGaAs 
waveguides 115 11. Note that, for the isotropic class 

xL7=2X;;u+ %;?«;• (A-'4) 

and. invoking Klcmman symmetry, X^jkk — X^kj- 

A.2.2 Third-Harmonic Generation 

The general form of the instantaneous polarization at the third harmonic driven by the fundamental at coq is 

4hc 7(0 = |x7t/™(3‘*Jo;o)o.O),i.a)o)A*(/)A,(f)A„,(/)e-3^^^^ (A.15) 

where the frequency dependence is neglected for simplicity. This is not an approximation as such because in the multiple-scales 
analysis of Chapter 3. the frequency dependence of the third-harmonic susceptibility plays no role to the order of the derivation. 
Using the spatial symmetry properties, the terms in the third-harmonic polarization are 

1) X777y(3t«o;Wo,t»o,«o)Aj(/)A;(/M;(r) 

2) X77«(3a)o;coo,Wo,tOo)Ay(r)A*(/)A,t(0 

2 ) Xy*7t(3(Oo;tOo,(Oo,(Oo)AA.(/)A;(r)/l;(f) 

2 ) Xjkkj{^oX)\(iio,(i>o,(iio)Ak{t)Ak(t)Aj(t). 
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The intrinsic permutation symmetry property forces the last three terms to be equal. The polarization is then 

= f [x;7j7(3wo;coo,(Oo,o)o)A,(rM^^ (A.16) 

+ 3xyuv(3too;(Uo,(Oo>(<^o)^/.(0'4t:(^M;(^) ] ^ 

where, for the isotropic symmetry class, 

Xjjjj{3o)Q;(no,U)o-o^o) = 3Zjiikji3(Oo;(i>Q,(tio,o^}- (A.17) 

Note that these expressions are generally valid even under the conditions of three-photon resonance, neglecting frequency 
dispersion. 


A.2.3 Third-Harmonic Downconversion 


Again neglecting frequency dependence, the form for the polarization at the fundamental due to mixing between the third- 
harmonic and the fundamental is 


p(3)+ 

^Tf/DJ 


,(0 = 


£o 


[ XjJiii,i(<^o- 3t0o. -coo. -Wo) Sjt(t)A*i (t)A*,(t) 

(Wo; -Wo- 3(Oo, -tOo)^it )^ii* )KM 
+Xjkim (Wo; — too. — Wo, 3a)o) (r)A; (r)S„,(r) j 


g-itoo' 


(A.18) 


w here S is the envelope of the third-harmonic. Using the spatial symmetries, the terms in the polarization are 

1) X;7yy ( Wo ; 3(00. - Wo , - Wo ) 5; (r) A} (r) A} (/) 

1) Xy777(wo;-Wo.3tOo,-(Oo)A}(r)5;(/)A*(r) 

1) X;777(Wo;-Wo.-Wo,3(Oo)A}(r)A}(/)S;(/) 

2) Xy 7 «(wo; 3(0(1. -tOo. -(Oo) Sj(t )Al{t )A^(/) 

3) Xj ( Wo: - Wo. 3(Oo. -tOo) A}(f )Sk(t )Al (/) 

3) Xy7«(Wo;-Wo.-tOo.3a)o)A*(r)Aj.(/)5it(/) 

3) Xjkjk (Wo; 3too. - too. -too) Si (?) A} (r (r) 

2) Xytytlwo;—Wo.3too. —lOo) A]J(r)Sy(/)Ajr,(/) 

3) X;i(7it(wo;—Wo--Wo.3tOo)Aj(/)Ay(?)Si(/) 

3) XyH7(“o;-3wo. -Wo. -tOo)Si(OAi(/)A*(/) 

3) X;Uy(tOo;-tOo.3tOo.-tOo)A;.(t)Si(r)/4}(r) 

2) X7«;(Wo;-tOo-(Oo.3tOc))A2(f)A;j(r)Sy(/), 

w hich IS in the .same form as nonlinear refraction. This is expected because in both cases, there are only two non-degenerate 
frequencies. 

The nonlinear polarization for third-harmonic downconversion is 


^ [x; 777 (W(.; 3 tOo.-(Oo,-too)Sy(r)A}(r)A*(f) 
+ 2 xy«y (Wo;- 3 too, - too,-too) Si (f )Ai (r) A* (r) 
+ XyAt(wo;3too.-(Oo.-too)Sy(t)A]J(r)A*(r) 



(A.19) 


For the isotropic symmetry class, 

Xy 777 '(Wo; 3 ( 0 o. -too. -tOo) = 2 xyiiy(wo; 3 (Oo, -tOo, -tOo) 
+ Xy 7 u(Wo; 3 tOo,-too,-too), 


as before. 
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A.3 Symmetries of - Quintic Nonlinear Refraction and Three-Photon Absorp- 


In the case of the fifth-order polarization, equation A.2 reduces to 

:=EoJ-J )^/((^^2)^m(C03)£n((04) X (A.20) 

£o(a)5)8(CO-(Oi --C02~0)3 --(04--C05)flfa)i^/C02^/(03^(04^C05. 

The tensor has 729 elements. Even though the development of the third-order polarization is valid for both the isotropic 
and cubic 432, 43m and m3m symmetry classes and processes other than nonlinear refraction, the fifth-order polarization is 
developed only for isotropic symmetry and instantaneous response. This is a simplification that has no consequences in the 
multiple-scales analysis. Any frequency dependence will appear one order later than the instantaneous response. In addition, 
fifth-harmonic generation is also not described because the influence of the fifth-harmonic on the fundamental occurs well 

beyond the order of the multiple-scales derivation. 

In the lime-domain then, the fifth-order instantaneous polarization for nonlinear refraction is written 

(COq; “-COq. *“(00. COq. (Oq, Wo)A][ [t)A* (t)Ani{t)A,i{t)Ao{t) 

+ Xjklninoi^Qy^^O^^O^ 

+ Xjkim,w{(i^o ; -^0 .0)0 ^ Wo • -0)0. Wo) A;, (f )A/ (f )A,„ [t )A*^(t)Ao {t) 

+ Xjkimno (Wo; “ Wq , Wo , Wq . Wo . - Wq) (t ) A/ {t )A„i {t ) A„ (r) A* (f) 

+ Xjkhmni^O'^^O^ “Wq. -Wq. Wq. Wo)A^.(r)A/(/)A*j(r)A„(f)A,,(r) 

+ XjT7m«r;(wo; Wq. —Wo.o)o. —(Oo. Wo)Ajt(r)A/ (/)A,„(f )A,j(r)A^,(r) 

+ XyT 7 mm/(Wo» Wq. —Wq, Wq. Wo. —Wo)A)t(r)A^ (f)A„i{t)Afi(t)A^j(t) 

+ X7t7mm;(wo; Wo, Wo.-W q.- 0 ) 0 . Wo)A^.(/)A/(r)A*,(/)A*(r)A,;(r) 

+ X7it/m«r>(wo»Wo. Wo. —Wo. Wo. “0)o)Aj(;(/)A/(r)A„,(r)A;,(r)A^/r) 

+ X;T7»wjr^(o^o;wo.o)o. Wo. -Wo. -o)o)Ajt(0A/(/)A„,(r)A*(r)A*(0 ] ^ 

Spatial symmetry of the isotropic class results in 183 nonzero terms of the sixth-order tensor, with 30 independent: 


K-jjjjjj K-jjjjtt^ Xjjkktk Xjjkkj] Xjjjkjk Xjjkjkj Xjjjkkj 

Xjjkjjk XjkjkjJ Xjkjkkk Xjkjjkj Xjkjjjk Xjkkjkk Xjkkjjj 

Xjkkkjk Xjkkkkj Xjjkkii Xjjkiki Xjjkiik Xjkjkii Xjkjikt 

Xjkjiik x.jkkjii y.jkijki y^ikhik Xjkkiji Xjkikji Xjkujk 

Xjkktij Xjkllkj 

where I. Using Kleinman symmetry (which is not strictly valid near three-photon resonance), the fifth-order polarization 

responsible for nonlinear refraction and three-photon absorption can be written 

^ [xjjjjjji^y- -Wu -(Ai-Wo.Wq.oJo) [lO lAjit)^Aj(t) 

+5 Xyyt (“O; - Wo ■ - too -• Wq , tOo) X 

[4 (f )/\* (/) -f 6 |i4jt(f) |‘’i4j(f )j 

-I-10Xj;y7u(wo:-Wo.-0)0.Wo,a)o,(Oo) X 

[3/f*-(r)At-(r) -l-Aj{r)A;-(r) +6l>i7(f)|- |A;t(/)|-] Aj(t) 

+ 15Xj 7U//(Wo.-Wo,-(00,0)0,0)0,(00) X 

[ 4 |A,(r)|Mr(r)A‘(r) + 2 Af(r)A 7 (f)Ay(r)-f 

4lAi.(r)|-|A,(f)|-A7(r)j |, 





Final Report AFOSR F49620-95-1-0431, Spatial Soli ton Interactions, S. Blair & K. Wagner, U. Colorado, Boulder 


138 


where XnmJt = Xjjjjkk- I" this case, Kleinman symmetry means that there are no one-, two-, three-, four-, or five-photon 
resonances. As mentioned previously, these processes (except for the possibility of three-photon absorption) are assumed weak 
at the very least for materials of interest. 

Now, using the relationship [235] 

%Jjjjjj ” ^ Xjjkkkk =15 Xjjkkll t 

the fifth-order polarization is finally written 

= ^X; 7 ; 7 ; 7 ((Oo;-coo.~coo,coo,a)o,w^^ x (A.23) 

\Aj{t )\^+ - \Aj(t)\“ \Ak{t)\^Aj{t) 

+l |4t(/)|-Ar(0^}(0 + ^A}-(/)A,-(r)A;(r) -h+(r)A;(f) 

+ ^ lA*(01'A/'(t)A'(0 + ^Aj‘(f)Af(r)A;(r) -h - lA<,.(f)|'|A;(r)|"Ay(r) . 

A similiir result has been obtained previously [226]. The first three and last terms are phase-independent and give rise to 
tilih-i>rder self- and cross-nonlinear refraction. The second line contains phase-dependent couplings between two polarizations. 
These terms are rigorously absent for the orthogonal circular polarization basis set [226], and are neglected here under the 
asNumpiion of differing phase velocities between the orthogonally-polarized linear eigenmodes of a slab waveguide. The last 
three terms contain all three field components and can be neglected. This is justified in the multiple-scales analysis where it 
Is shin^n that, in the paraxial approximation, the projection of the field onto the direction of propagation is one order smaller 
ih.m the transverse projections and therefore, in the fifth-order polarization, all three components do not enter to the order of 
the dcrnalion 






Appendix B 

The Raman Response Function and Raman 
Susceptibility 


This appendix examines the third-order nonlinear response given by 

pf\r,t) = toR%,Jr)Et{r.t)Ei(r,t)E,„(r,t) (B-1) 

-l-Eoj^ /?%„(r.T)£t(r.r-T)£;(r,r-T)£,„(r,rMT, 

which consists of instantaneous (Kerr) and delayed (Raman) parts. Specifically, the second term, or the Raman response, is 
examined in detail here. The spatial dependence will be dropped in what follows. 

The first section presents a classical derivation of the Raman susceptibility based upon a simple spring-mass model for 
molecular vibration. The second section derives the Raman gain coefficient which is the quantity measured experimentally. 
The final section takes the third-order polarization given by equation B.l and approximates the convolution integral for finite- 
bandsudih excitation. The resulting form is more suitable for analytic representation. 

B.l Origin of The Raman Response 

The Raman response typically arises from the motion of a molecule in response to a driving electromagnetic held. This is in 
contrast to the electronic response which involves oscillation ot the electron cloud relative to the nucleus, producing a radiating 
dipole. A molecule can undergo rotational or vibrational motion, but since this thesis is mainly concerned with nonlinear media 
in the solid-state, it is a.ssumed that the vibrational motion dominates. 

In order to derive the Raman response function. a.ssume that there are N identical harmonic oscillators per unit volume, 
where each oscillator represents one molecule. For sirnplicity. only the 1-D scalar case is considered so that the vibrational 
coordinate is represented by X(/). Newton's second law [2.16| states that 

(B.2) 

(If- 

w here ni is the mass and. in the case of the harmonic oscillator tor simple spring-mass system), represents the total force 

on the molecule and includes the force of the driving field and the natural restoring force via Hook's Law. 

The electrical driving force on the molecule can be derived by first considering the electrostatic stored energy density [237] 

•£= [£(/)• £(f)], (B.3) 

where E is the total material dielectric constant. For simplicity, keeping only the contribution from molecular vibration, the 
dielectric constant can be written 

e = E{)[1-l-A(a(X)], (B.4) 

where Eo is the free-space dielectric constant, and a represents the molecular polarizability (not the linear absorption coefficient 
which is not considered in this Appendix), which is necessarily a function of the normal coordinate of vibration X, i.e. the 
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driving force is a function of the offset from the rest position. Taking a two-term Taylor expansion of the polarizability results 

;)ry 

a(X) Oo-l-X ^ , (B-5) 

x=o 

where Oq is the polarizability at the rest position and the first derivative is the differential polarizability. 

Now, the electrostatic stored energy can be written 


Efl f r , 
£=-|l + /V^ao + X^^^^ 


[£(0 •£(/)]. 


The force exerted on a unit volume of polarizable material is then 


where the force is divided by N to represent the force per molecule. The total force can now be written 

2 dX 

1 lore, k the spring constant representing the restoring force of the molecule. 

The equation of motion for the oscillator is then 

= (B.9) 

where ihe natural oscillation frequency £2/ = \/k/ni. With the inclusion of a phenomenological damping term proportional to 
the velocity of vibration, the equation of motion becomes 


d-X(t) dX{t) 


(B.IO) 


which is the equation of motion for a damped harmonic oscillator. Transforming into the temporal frequency domain results in 
the solution 


_ fid 

-Q--/£2y + Q7 X(Q) = — 


:r{£(f) •£(/)} (£ 2 ) 


wot - i !r{E{t)-E{t)}{Q) 

' 2w \ £27-£2—/£2 y 


which gives the expressum for the molecular vibration spectrum. 
Niuv. the material polarization is defined 

where the induced dipole moment of the molecule is 


p(r) = Eoa(X)E(/)^^eo a,) + X(/)^ E{t) 

X=0l 

= e<,fa«-h^-'{X(£2)} ]£(f). 


The nonlinear part of the material polarization due to Raman scattering can be written 

/><'»(,) = eoj-'£( 0 } (ii)} 


(B.ll) 


(B.12) 


(B.13) 


(B.14) 
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Stokes shift n THz 
-40 -20 0 20 40 



l-mure B.l: Real and imaginary parts of the Raman susceptibility for fused silica. Here, y= 10.4 THz = 65.6 rad/ps, Q/ = 
14 2 THz = 89.0 rad/ps. and. at Xf = 1.55 pm, Ro = 2.62 x 10~'VcmVW • ps*. The heavy solid curve is the imaginary part 
and the thin solid curve is the real part. For positive Stokes shift £2, the imaginary part leads to gain, while for negative Stokes 
shift, the imaginary part leads to loss. The dashed curves indicate the three-term Taylor approximation which is valid within 
the range ± 5 THz” The dotted curve is the frequency-domain convolution of a 16.5 fs sech() pulse. 


where the Raman susceptibility takes the form 




ax 


Ro 


0 [ Qj-Q--iQYj £2 t-£2--/£2y' 


(B.15) 


The real and imaginary parts of the Raman susceptibility appropriate for silica [141] are shown in Figure B.l. Note that for 
Q » £2/. Y. the susceptibility 0 because the time-scale of the driving field is too short for the molecule to respond. 

Transforming back into the time domain 

pl^^\i) = ^{R^{t)*[E(t)E(t)]}E(t). (B.16) 


where the Raman respon.se function is defined 


R^{t) = y-' 


Ro 


£27 - £ 2 - - i£lr^ 


Rpe ^'l-s'm{QRt)Q(t) 
Qr 


(B.17) 


and £2 r = ^£lj - y-/4. Written in integral form 

= R^(‘t)E(t-x) E(t-x)E(t)dz. (B.18) 

The Raman response function for silica is shown in Figure B.2. 

B.2 The Raman Gain Spectrum 

The previous section derived the form of the Raman susceptibility and Raman response functions for a single L-orentzian line. 
In typical experimental situations, the Raman susceptibility is mapped out by measuring the Raman gain versus the difference 
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Figure B.2; Raman response function for fused silica: y = 65.6 rad/ps, Qr = 13.2 THz — 82.7 rad/ps, and, at X/ — 1.55 //m, 
R,i = 2.62 >: 10“'* cm-/W ■ ps*. 


frcqucnes betueen two monochromatic beams: the pump beam and the probe (Stokes or anti-Stokes) beam. The probe beam 
e.xperiences e.xponential gain or loss at the expense of the pump. 

This theoretical treatment of Raman gain begins with the scalar nonlinear coupling between two monochromatic plane 

waves tiir simplicity 


E{zJ} = + CC, 


(B.i9) 


such that only the inter-pulse effects of stimulated Raman scattering are treated. 

Since this analysis deals with monochromatic beams, the temporal frequency domain is the natural space in which to derive 
the Raman gain coeflicient. Transforming the electric field then 


£(c.to) = ^-4;,(c)A*5(cD-tUp) -I- '■*>=5((D-l-a)p) (B.20) 

-t- -h ^/l‘(c)e“'‘’'S((o-t-to,), 

and iiint>rini! linear absorption and diffraction, the scalar nonlinear wave equation in the temporal frequency domain is 

l:*(u))£(co)-t-(B-21) 

a:- c- 

+ ^!Ly jx^{ur,Q)E{o)i)E((i>2)E((S)-Q.)duyid(r)2 = 0, 

where Q = Wi -t- OJ; and initially both the electronic Kerr and Raman susceptibilities are considered. 

The nonlinear polarizations of interest are those that drive the wave equation at (Up or (O.,. These polarizations are 


-l-2x^(iD/>; -to,..to,v,t>Jp) Kv|‘l /fpA' 


(B.22) 






(B.23) 
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+2x*^((0,v; —(Op,(Op,COj) |j4p| A^e ^ 


Pyf(a)p) = jjj [^p' 4 p 8 ((Oi +( 0 p) 5((02 - Op) (B. 24 ) 

+ Ap4*5(0] -0p)5(02 +Op) Ap5(o-Q-Op)e'*'’Voirf02 
y /y A*A., 5 (oi+Oi) 5 (o 2 —o.v) 

+ A,vA;S(Wi-0 ,)8(02 + 0,.) Ap5(0-Q-0p)A'^/0i(f02 

+ ^j J ApA* 8 (oi-Op) 5 (o 2 + Oj) 

+ A*Ap 5 (Oi+ 0 .,) 8 ( 02 - 0 p) A,v 8 (o-Q-o,)e'*'’’(/oi (/02 

Pp(co,) = AM.v5(oi+o,,)5(o2-o,,) (B.25) 

+ A,A*6(Oi - o,) 8(02 + o,v) ] A,8(o- Q - o,v)c'*''£l0i<i02 
+ j/yx'^(o;£ 2 ) A),Ap5(0|+0p)5(02-0p) 

+ ApA* 5 (oi-0p)5(02 + 0p) A,v8(o-Q-o,^)e'*'■(/Ol^/o2 
+ ^J JX^{^'<^) A,A* 5 (oi-o,v)8(02 + Op) 

+A*A.,8(oi +Op)8(02-o.v) j Ap8(o-Q-Op)e'*’'(/Oi(fo2, 

where the intrinsic symmetry property was used in the instantaneous polarization and the four-wave mixing terms (2(0^ — co.v, 
etc) are neglected. Evaluation of the Raman susceptibility integrals results in 


/Vf(a)y^) ^.v)] |^.v| 1^/7^ 

P«(o,) = ^{x'^(Wv; 0 ) lA,P+[x'^(“>; 0 )+x‘^(«v;o,v-Op)] |Ap|-|A,vc'^''. 


(B.26) 

(B.27) 


Definme the nonlinear indices 




where Q = o)^, - (o,, and making the slowly-varying amplitude approximation (SVAA) 

.. -,ji, ' 


(B.28) 


(B.29) 


(B.30) 


the coupled nonlinear evolution equations become 


3 : ’’ /!(Op) 


/iR(Op.O) 


^ »)/;(Op, 0 ) + /l/?(Op 


+ 2 + 


«/(( 0 )pj 


\AA-y 


+ /-;^/m{x'^(Op;-Op.Op,Op)} |Ap|‘Ap 

4c*' 


+ (^ 2///i{x'^(Op;-o,,o,,Op)} + ^//«{x’^(tOp;Q)} |A,.|-Ap = 0 

4c“ [ J 


(B.31) 
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dz n(o)s) 


+ 2 + 


/ij?(a), , 0 ) ] I .2 

' uM J ' 

, ^ /l/;(COf 0 )+/lj?(CO.t,-fi) 
UK ((Os) 


(B.32) 


\^p\^ Ma- 


+ i^Ini {x'^(co,;-(O,.C 0 ,v.a),v)} 

+ /^ 2/;H{x'^(a),;-tOp,(i)p,a)i)}+|/m{x’^(co,;-Q)} |4p|-A.v = 0, 


where Im {x*'} corresponds to self- and cross-two-photon absorption and is always negative, and the Raman gain/loss suscep¬ 
tibility component given by /;«{x’^(co;0)} = 0 by (anti) symmetry. Note that now the ratio between the cross- and self-phase 
modulation coefficients no longer equals 2 in general, a fact previously noted [238]. 

Considering only the imaginary part of the Raman susceptibility responsible for Raman gain for simplicity, the coupled 
equations can be rewritten 




dAs _ 

Bz An 


3 ///i{x'^(co,;£ 2 )} 


where = =/)(co,v) and//;i{x'^(-i2)} =-/"I from equation B.60. 

.MultipK mu equation B.33 by Ap and adding to the conjugate of equation B.33 multiplied by Ap results in 


3|/4p| kp 


3 /»! { x"^ (tOp ; } \As P |/4p I ■. 


(B.33) 


(B.34) 


(B.35) 


DolininL* the optical inlcnsily 


equal ion H 35 can be rewritten 


/(O)) = |/1(m)P, 


’—I'n{x^{(np;Q)}lslp. 


(B.36) 


(B.37) 


(B.38) 


d: e{)cn-pn, 

Perlt^mnny the same steps with equation B.34 results in 

= -A- I»>{'A(Os-,Q) } /p/,. (B.38) 

d; tQcn;np 

.Adding these two equations in order to obtain the evolution of the total optical intensity indicates that intensity is not conserved. 
Since the interacting lields are monochromatic plane waves, this directly implies that optical energy is not conserved, because 
energ\ is supplied to molecular oscillations, then dissipated through the damping term. 

What Is conserved by this system of equations is total photon number. Defining the photon flux 


e(,r;i(ti)) 2 

rb(c)) = —-|/ 1 (( 0 )| 

Jio) 


the coupled equations for the evolution of the photon flux are 


^ Im ix^tOptO)} <t>s<t>r 

^ In, {x^tOv.Q)} Op<l), 


(B.39) 


(B.40) 


(B.41) 


w here it is apparent that total photon number is conserved. ^ ^ 

Now. back to the original purpose of defining the Raman gain coefficient. Typically, when |4p|' > the depletion of 
the pump is negligible, reducing the coupled equations to just 

^^^^_In,{x'^{(ns-,£l)}\Ap(0)\-As{z), 


(B.42) 
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where the z dependence has been restored. Now equation B.42 can be easily solved 


where the Raman gain is defined as 




G/i = |Ap|- = gfilp- 


The quantity typically reported in the literature as the Raman gain constant is, from equation B.38, 


gR= -^///i{x'^(Wv;£2)}, 

eocn-Hp 


(B.43) 


(B.44) 


(B.45) 


and has units of cm/W. For example, in silica-core fiber, a gain constant of gs = 1.86 x 10 " cmAV was measured for the peak 
Stokes shift of 13.2 THz (440 cm~') with a pump wavelength of 532 nm [141], and in Alo 24 Gao 76 As slab waveguide, a gain 
constant of = 7 x 10“* cm/W was measured for a peak shift of 8.4 THz (280 cm ’) with pump wavelength 515 nm [151]. 
For the Raman susceptibility given by equation B.15. the real and imaginary parts are 




Ro [q}-Q- 


//n{x'^(n)} = 


(B.46) 


(B.47) 




. 1 ) that the Raman gain constant can be written 


EoC/l«'7, jQ:_Q:]'q.Q2 


(B.48) 


The peak ol the imaginary part occurs at the frequency 


f2m;ix — 


2 i 25 -Y-^ •f‘-4Y-£2^--f 16Qj 


(B.49) 


For weak damping Y finux- the frequency at peak gain Q„ux £2/. and the line width ol the gain % y. It is useful to 

obtain an expression tor Q j . Solving equation B.49 then. 


Qy — QiiuxY' 


(B.50) 


so that £2, can be determined when the line width Y and peak down-shift £2„u.x are known, which can be easily obtained from 
experimental data. 

The resonance approximation to equation B.15 is common in the literature. Taking Q.j and after a series of straightfor^ 
ward manipulations, the susceptibility reduces to 


with real and imaginary parts 


2[£2y-£2]-/Y’ 


4[%-a] -fY- 


I>n{y^(Q)] 


. ^oY/Q 
4 [£2/ - Q] ' -I- Y' 


(B.52) 


(B.53) 
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I-igure B.3: Raman gain and index spectra for fused silica, calculated from the data in Figure B.l. The Raman index is defined 
hv equation B.29, and the Raman gain constant is defined by equation B.45. 


Now. the Raman gain coefficient for the Stokes beam becomes 

ep(i2) R:-T -- tt-. 

£ocn,/t- [£2y-Q]' + Y-/4 


(B.54) 


The Stokes down-shift frequency that experiences maximum gain is then given by 

4Q/+J4Q^-3y= 

O ^- 


(B.55) 


Note frv>m equation B.15 that Rq has units of m-/V- • s^ In the International System (SO of units, the free-space dielectric 
constant has the value f,) = 8.854 x 10"'- C'/N m^ which also has units F/m, or W s/V- m. For silica fiber. Rq = 5.03 x 
l()“''cnr V- s- which for = 1.55 ;um, equals 2.62 X 10"'-cm-/W ps-. 


B.3 The Raman Susceptibility 

Using the same steps as used for the linear material polarization in section 3.1.1, the Raman response integral is transformed 
into The temporal frequency domain. The frequency-domain representation is approximated for finite bandwidth excitation and 
transformed back into the time domain in a form that can be easily incorporated into the theory and numerics of subsequent 
chapters. This result is compared to that obtained directly from the Taylor-expansion of the time-domain form of equation B.l. 
Finally, using the results of Appendix A. the third-order polarization is reduced to a form suitable for the isotropic and cubic 
43m symmetry cla.sses. Section A.2 discusses the intrinsic permutation symmetry property of the fully dispersive case and the 
reduction to the Raman response, and the spatial symmetry properties of the chosen classes as they relate to 

Writing the driving fields in their Fourier-domain representation, the Raman response of equation B.l becomes 

= l%m(^'’^)Ek(o^\)Ei((i32)E„,{t)e~‘^d(3)]d(s)2, 


(B.56) 
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where Q = 0 )| +(03 is the Stokes (Q < 0) or anti-Stokes (Q > 0) frequency and the third-‘order Raman nonlinear susceptibility 


is defined 


X%iJmQ) = jem%i,,,(T)e‘^dT 


(B.57) 


and (~^) = Xyr* m(^) because the Raman response function is real. It is evident that the Raman susceptibility is subject 
to the^^ramers-Kronig relation discussed in the context of the linear susceptibility. 

The imaginary part of the susceptibility Ini where Xr(S^) = 5le{x'^(£2)} + i7»i {x''(i^)}. is proportional to the 

Raman gain, which is typically the quantity measured in experiments, as discussed in the previous section. Therefore, it is 
useful to have a relationship between the imaginary part of the Raman susceptibility and the Raman response function. The 
inverse Fourier transform of equation B.57 results in 




(B.58) 


Through the Kramers-Kronig relation, the real part of the Raman susceptibility can be determined from the imaginary part 
and the response function evaluated by relation B.58. Alternatively, the Raman response function can be determined directly 
from Ini {x’^(f2)}. Since the Raman response function is real, by equation B.57, the real part of the Raman susceptibility is 
symmetric and the imaginary part anti-symmetric. Note that equation B.58 can be rewritten 


?*^(t)©(t)= J [l^e{x'^(to;Q)}+/7w{x'^(t<J;^i)}] [cos(QT)-/sin(QT)]i 
= j!K.e{x^((i)-,Q)} cos[Qx)dQ-^- J 7«i{x'^((i);Q)} sin(QT)^fQ, 


(B.59) 


where the anti-symmetric terms integrate to zero. Because the left-hand side of the equation is zero when T < 0, it must be true 
that the two integrals on the right-hand side are equal and opposite when t < 0, leading to the relation 


r'^(t)0(t) = 2 j 9le{x^{t)>\^)}cos(Q.z)dQ. 

= 2 j Im{x^((n-,Q)}sin{Q.x)d^, 


(B.60) 


when T > 0. 

The nonlinear polarization is now written in the temporal frequency domain 

= eo/)£/(«;) I 

= to j j (B.61) 

Using the f-ourier-domain wavepacket representation and keeping only the terms producing a polarization near coq, e.g. the 
terms responsible for nonlinear refraction and nonlinear gain/ab.sorption, the polarization becomes 

M X (B.62) 

[A]J(a)i-t-ti)o)/4/((l)3 ——^~®o) + 

Ai((i)| - Wo);4*(a)2 + tOo)'4m(tt>-Q-coo) + 

A*(a)| -a)o)/4;(co2-cooMm(t>^-^ + ‘t^o)]<ftOi<7a)2> 

where j.k.I.me {-v.y,;}. The Raman response is typically at low frequencies (when compared to optical frequencies), for 
example, the peak of the Raman gain in fused silica is at 13.2 THz. Therefore the strongest response will occur when Q is near 
that peak. For the last term of equation B.62, Q % 2(Oo and X/?(2a)o) ss 0 from section B.l. 

In order to arrive at a simpler time-domain representation, the Raman susceptibility tensor of equation B.62 is Taylor- 
expanded about Q = 0. The Taylor expansion is 

Xyrt«.(“o;f 2 ) = X;r/«,(“o; 0 ) 


(B.63) 
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+ [o)i ±a)o] 


^xWtOo;( 0 |±mo) 


ltOl==Fa)o 


, , 9x"|,„(<Oo;TOJo + 012) 

+l“=T 01 ol -- 


I(i )2 = ±t0o 


Using these expansions, the integral of the Raman susceptibility is approximated as 

pg+(o)) = ^//{x'^ + [o)i+o)o]xf+ [® 2 -o>o]xf+ •••} 


Al ( 0)1 + a)o)'4/(W2 - Wo)A„,(r, (o - £2 - ( 0 o)da)i£?(i )2 

jJ I {x'^ + [a)i-coo]xf+ [(02 + coo]x2' + --}x 

A*(coi - coo)A,’ (C 02 + too)A„,(o)- £2 - (Oo)d(Oido )2 


where the following shorthand notation is used 

= Xy*/„,(“o;0) 


^X%,,(ft)o;cOi -coq) _ 9x%,(t0o;-«>o + t02) 


(B.64) 


(B.65) 


(B. 66 ) 


R. _ ^X%,(t»>o-Q)i+Mo) _ 9xV,(tOo;tOo + co2) 


Ia> 2 =-t 0 o 


(B.67) 


Now equation B.64 can be written in the time-domain. For example, inverse-transforming the first integral of equation B.64 

= yX'^///'4l(Wi+o)o)A/(a)2-Wo) X 

-hyX^ y j y[wi+W o]a;(o)!-(- a)o)AHw2-Wo) X 

i4,„(C0 — ( 1)1 — 6 ) 2 (Do) 1 ^ 0 ) 2 ^ 


yx'^y y j An,((o')e '“''</0)'Aj.(0)i-t-0)o)e '"'I'r/cOi x 

A/(o) 2 -COo)e“'‘'’-'^/o) 2 e“'‘^' 

^X?'/y y^i(a)')c-'‘'’''dm'[m,-hmo]A;(Wi-hO)o)e-'“''rfco 
>4/ ( 0)2 ““ -j- . . . 






Using equation B.57» the coefficients 


xf = X 2 ' = ' y T©(t)Ryii,„(t)dX = X^'- 


(B. 68 ) 


(B.69) 


Note that x*^' is purely imaginary because it is proportional to the slope of the Raman gain at zero frequency. Now the first term 
of the Raman polarization can be written 


fW* = I + • ] A.,!,).-'-' 


= TZ;«,Nile's: [A;(.M,(,)]A,(»e-'*»', 


(BJO) 
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where the derivative argument acts on the quantity in brackets. Combining this expression with that obtained for the second 
integral in equation B.64 results in 




(B.71) 


Here, the function of the operator is interpreted by a Taylor expansion about zero frequency. 

An alternative approach is to start with the time-domain convolutional representation given by equation 3.18, rewritten here 

in component form 








Defining j = / - t, the delayed fields can be Taylor-expanded about / 

^ [-5-^]“ 9"'4(5) 

/l(5) =/l(r) + [5-t] 

_a/l(r) T- d-A(f) 

_4(f) 3^ a, + 2 3,2 ■ 

Substituting into the convolution integral results in 




(B.72) 


(B.73) 


(B.74) 


Using equation B.57. it is clear that equations B.71 and B.74 are the same. Defining the constant 




equations B.71 and B.74 can be simplified to 


^ 7 -r (/)! (fV"'"’"' 


- f|x'^'Vo;0) 


d-A:{t)A,(t) , a-/li(/)/i;(r) 
dt- dt- 




(B.75) 


(B.76) 


The first-order term in the Raman expansion is a linear approximation to the Raman gain/loss curve. The second-order term is 
a parabolic approximation to the dispersion of the Raman mmlinear refractive index. The next correction tor the Raman gain is 
third-order cubic as evident by the fact that the imaginary part of the susceptibility is anti-symmetric. 

The total induced third-order polarization including the instantaneous and Raman contributions can then be written 


(^) = J { x5i;m(“o; -(0().0)0. 0}o)Al{t)Ai{t)A„,(t) 

+x5i/m(o)o;o)o. -(Oo.a)o)/4*(f)A; (r)A„,(r) 

+Xj*/m(o)o; 0 )(). OJO.-a)o)-4, (/)4, (r )A‘, (r)-f 

+Xyt/m(“o;0) A;,(r)4,(f) -f/l,.(r)/4;(r)] A„,(r)f 


(B.77) 


Eo_r \dAl(t)A,{t) , aA,(r)A;(r) 


- —r 


A„,{t)e' 


dt ' dt \ 
ra-Ar.(/M/(/) d-A,ii)AUty 
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Using the spatial and intrinsic symmetry reductions from section A.2, equation B.77 becomes 


= ^l^xfjjj(oio<-(Oo-Oio^c>yo)Aj(t)Aj(t)Aj(t) 

+2Xjuj(0)o-. -coo.coo,cooM*(fM*(0'4;(f) 

+7 {[ Xy;7>(®o;0)A}(r)A;(t) +x^ii/(Do;0)A;f(r)Ai(r)] Ajit) 
+Z>>u(“o;0)[A;j(/)Aj(t)+A}(r)Aar)]A;.(f)}e-'‘^^ 

eo fU 3^*( 0^(0 R 

- j I l^jw — dt — dt J 




aA;.(/)A;(r) dAj(t}Mf) 
dr ^ at 


8 


p» d-Aj(r)Aj{r) ^ 8-A;^(/)A^(t) 

^jjjj 9,: 3-Xjkkj 0^2 



’a=A;,(t)A^(/) a-Aj(t)A*(t)‘ 

dt- + at- 


Aii(t) 




(B.78) 


which i> the form used in the multiple-scales analysis. 




Appendix C 


Derivation of the (2+l)-D Vector Nonlinear 
Schrodinger Equation for a Slab Waveguide 


This appendix reduces the full (3+l)-D NLS evolution equation for nonlinear spatio-temporal propagation in homogeneous 
media to the (2+1 )-D equation appropriate for propagation in a planar waveguide structure that supports TE and TM modes. The 
ke\ to this derivation is the use of separation of variables in which the functional form of the one-dimensional linearly-guided 
profile is separated from the envelope in the unguided dimensions. To lowest order (sec. C.l), this separation of variables results 
in the well-known effective index approximation, which reduces the dimension of the problem. At higher orders, the concept of 
cflective index is extended to the linear dispersive (secs. C.2 and C.3) and nonlinear terms (sec. C.3). The resulting nonlinear 
wave equation is the (2-1-1 )-D reduced version of the (3-1-1 )-D NLS equation (with the transverse dimension y removed), with 
effective values of the material parameters which represent the influence of the transversely guided modes. 

Section D.2 i»f Appendix D evaluates the effective linear dispersion coefficients for a generic three-layer slab waveguide 
structure. The results of this analysis for typical waveguide parameters justify certain approximations made in this multiple- 
.scales derivation in order to obtain an NLS equation for the waveguide modes. 

The waveguide geometry is shown in Figure C.l. By symmetry, the waveguide supports two sets of modes, transverse 
electric (TEi and transverse magnetic (TM), which are ot orthogonal linear polarizations. Thus, the use ot the Cartesian 
polarization basis set is not arbitrary. The initial electric and magnetic field envelopes are written 


A, =xUr- (^y.(0„+ 4f(X,Z,.7)c'P'' '' 

+ y f v. 0),, + ^ {X. Z,. T)c'P™^ 






B]^'(X.Z,.T)e' 






(C.l) 


(C.2) 


such that the interaction between waveguide modes is handled explicitly by the choice of slowly-varying envelopes and 
Bj'*'. The waveguide mode profiles and are unitless with the components Uj^ and normalized to unity. 
Therefore, the envelopes a]^ and B™ have units V/m. while and have units of A/m. The constants 

r)’’'’'- = po<A)/ 3™ and ri™ = represent the material impedance seen by the TE and TM modes [239]. These 

impedances have units V/A = Ohms. 

The frequency-dependence of U and V is included to describe the variation of guided profiles (and effective index) with 
wavelength, which simplifies the analysis [ 194|. Note that the w-aveguide modes do not have z variation. This approximation 
fixes the modes such that they cannot change with propagation and is valid when the induced nonlinear index is small compared 
to the linear index variation of the waveguide, which is the case assumed here. In fact, typical nonlinear index changes are on the 
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J-i>:ure C.l Waveguide geometry used for (2+l)-D derivation. The linear refractive index varies along the v-direction, which 
gives rise to distinct .sets of TE (electric field polarized along ,v) and TM (magnetic field polarized along .v) modes. 


order of lO"'- lO”"*. while weakly guiding waveguide structures have A/i ~ 10 -^—10 -. Only one TE and one TM mode are 
considered and are assumed to be fundamental modes (no zero crossings of the field), but this assumption is not a requirement 
tor the general case derived here. Extension to multiple, higher-order modes, is straightforward [991. but not necessary in this 
thesis. 

Ciuided modes typically have profile variations on the order of a few wavelengths, therefore y is taken to be a fast variable 
such that ^ r -s ^ 'x T 


Because of this change. Maxwell's equations need to be suitably modified. The modification of Faraday's and Ampere’s Laws 
IS slraiiihlfi'ruard: 


3v 19c 9Z, 


- + + ' = '>0 + fir 


9a V 9a^ . r . . ^ 1 o 

'ax-17 = '" “”+"dr 
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a a 1-5 dB, 

— h S-T—-I-■ ■ I Bx £ 


‘ 3; az, 


. aBv dBx 
^dx dy 


^ 1 \ L f \ c ^ \~A J- ^ ■p(NL)+ 

-iSo coo + /e— U coq + j Ay + —Fy 


U/\ ijy 

' 9 L I • ^ ^ T I ^ ■^NL)+ 

-/eo + e 

Gauss’s Law, on the other hand, is not quite so straightforward because the dielectric constant now has variation in the v 
direction. This change is 

iur„„^ 4 .»i-'ixi (CIO) 


(,^■“”+''5?j'''J 

+j‘-/,,C0„ + .e|=) js^+ [|+8^+ = 


2/ 3^""’* 

Eo T ^ 




Finallv. the magnetic field divergence equation is 


dB^ dBx raB; dB. ] - 

634 + 3-+ ^ + e?r + '" =°- 

dX dy [ dz dZ^ 

With these chances from the (3-rl)-D derivation noted, the waveguide derivation can begin. 


(C.ll) 


C.l Order e - Effective Indices and Phase Velocity 


At the lowest order. Maxwell's equations result in 


0y4 1 - 0/4 1 V 

(TM) A': = /poWo^iA 

(C.12) 

0^4 

(TE) V : = /poWo^i.v 

(C.13) 

0/4 1 V -B 

(TE) c: = -/poCOoBi;. 

(C.14) 

which allows ihe elecinc fields to be determined from the magnetic fields. 


(TE) .V; =-/eoWoE^(.v,(Oo)Au 

(C.15) 

dB 

(TM) y; -^ =-/EoCOoE^(y,tOoMi.v 

(C.16) 

(TM) c: =iEoa)oE!^(y,(i)o)Au, 

(C.17) 

which allows the magnetic fields to be determined from the electric fields, and 


ainelj (v.oX)) - aAiv aJi- ' 

(TM) % .4,,+ + 3 .--O 

(C.18) 

dBiy 0 B 1 - 

(TE) 3 ; + 3 ;-=o. 

(C.19) 

which relate the v and c polarization components and offer no additional information. To this order, all six field components are 
nonzero. Equations C.12, C.16, C.17 and C.18 determine the TM mode and equations C.13, C.14, C.15, and C.19 determine 


the TE mode. 
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TE Mode 


Working first with the TE mode, taking the : derivative of equation C.13 and substituting equation C.15 results in 


Substituting equation C.14 for Bi;, 


ipo^ ti5o)^i.v 


-= WWo -+ /eo(Ooe^(>',o)oMi, 

■■ ^ dy- 


(C.20) 


(C.21) 


which, upon rearrangement, yields the eigenvalue equation 

^ = (C.22) 

Equation C.22 is a function of only the fast variabley and determines the TE guided profile Uj^{y, COo) and effective propagation 
constant (3^^. Since the envelope T) cancels, equation C.22 can be written as 






(C.23) 


This equation is most appropriate for waveguide structures with continuous variation of the refractive index, but in general, is 
not sufficient to determine the guided profile [240]. The constraint provided by the finite one-dimensional power of the guided 
mode provides the necessary additional information. 

For waveguide with discontinuous, or step, variations in refractive index, equation C.23 can be solved in each homogeneous 
reeion and linked through the appropriate boundary conditions. These boundary conditions are the continuity of the tangential 
electric and magnetic fields across the dielectric interfaces. For the TE mode, Ajc and B- are continuous, and by equation C.14, 
d \, /d\ IS continuous [239]. Therefore, the boundary conditions are given by the continuity of 


(//^(y.coo) and 




(C.24) 


Determination of the magnetic field components of the TE mode. t/7^(y. coq) and f/J^(y,(Oo), is not necessary since the wave 
equations will he developed for the vector electric field. 

For a known index and guided amplitude profile, the dispersion relation can be written as an overlap integral. This can be 
.seen by multiplying equation C.23 by and integrating, resulting in 


FtTEl- _ 






(C.25) 


TM Mode 

The T.M guided profile is mo.st easily obtained using the magnetic field. Taking the z derivative of equation C. 16 and substituting 
equation C. 12 results in ^ _ 

= /eocooe!;^ (y. tOo) - -f t>o«oS i tl • {C.26) 


Substituting equation C. 17 for A 


roTMi^fi _ Slne^jy^^ 

-[P J ®iv-- o,, a,. 


-eo^oMoE‘^(>’.Wo)Su. 


which, upon rearrangement, yields the eigenvalue equation 

dy^Hy-iUo) 9lne!^(y,a)o) ^ 

3y- 3y . 3y 

{[p™]--/:-(y,a)o)}v7M(y,a)o), 


(C.27) 


(C.28) 
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which again is most appropriate for continuous index variation. 

For step variations, the boundary conditions for the TM mode are the continuity of Bx and A- across any dielectric interface. 

Using equation C. 17 for A; leads to the continuity of 


vJ'^(y,(Oo) and 


1 


(C.29) 


In order to facilitate the derivation of the TE and TM nonlinear wave equations, the y and 5 components of the electric field 
associated with the TM mode must be obtained. From equation C.16 


V'v™(y,coo) = - 


f P™ 1 I/TM/. 


V™(y,Oio) 


Alv.coo) 




(C.30) 


which determines the y component, and from equation C. 17 


V7’^(y,coo) = 


q™e(,cooe!^(y,coo) 

1(3'^'^^ dF™(y,(Oo) 
A-(y.a)o) By 


(C.31) 


which determines the c component. These expressions are fully non-paraxial in the guided dimension because of the choice of 
V as a fast variable. 

Note that the ; profile of equation C.31 depends on a transverse derivative. The major contribution to this transverse 
derivative is the transverse wavenumber. When the transverse wavenumber is small (as is the case for low-order modes in a 
stronuly-guiding structure, or for modes in a weakly-guiding structure), the mode angular spectrum remains close to the j-axis, 
meanmg that « I'™, which is the paraxial condition. Even though the paraxial condition is imposed on the unguided 
envelopes .A and B bv the choice of the slow transverse variable X. it does not need to be imposed on the guided profiles U and 
I' which depend on the fast variable y. Later in the derivation (sec. C.3), though, this condition will be imposed in order to 
simplify the nonlinear polarization. 

The dispersion relation for the TM mode can be obtained in a similar manner to that for the TE mode, with the result 


/A-(y.(Uo)[''.™]Xv-/ 




/[FTNfrfy 


(C.32) 


The eigenvalue equation C.28 can be recast into a more familiar form in terms of the y-component of the electric field. 
Substituiing e!^. (y.coo)k',™(.'-Wo) into C.28 results in 1159] 


3-\’.™(v.a)o) dlnek (v.oo) 9v;’''‘'(y.a)o) 
dv: " -^ 


(C.33) 


^'ln|^':^ \f^'(y.o)u) = {[P™]--A-(y.tOo)}v7^’(.v,a)o). 


C.2 Order E- - Group Velocity 

Similar to the (3+l)-D derivation, this order determines the group delay of the TE and TM envelopes, except that now the 
group-delay has contributions from both the material and waveguide dispersions. The waveguide dispersion is responsible 
for the difference in TE and TM group delays, which can be tailored somewhat by the choice of waveguide parameters. In a 
weakly-guiding structure, the group delays will be nearly equal when the material is linearly isotropic as is the case here. 

Another difference from the (3-i-l)-D derivation is the inclusion of the frequency-dependence of the waveguide mode [194]. 
The first-order dispersion of the waveguide mode is included as 


+ (.v.coo) 


dAj^(X,Zi,T) .pTC, 


(C.34) 
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where Azx is the second term in the electric field expansion and is composed of an unknown component Ai, and the first term 
in the electric field expansion obtained at the previous order which now includes dispersion of the waveguide profile. The other 
components of the electric and magnetic fields are handled similarly. 

From Maxwell's equations then, Faraday’s Law results in 


dA-'- dA2\ 3Aiv . -= ^B\x 

(TM),: = 

, ^ D 

3Aiv dA^x D 

-53^--^ = WCOoS:,-/4)-^, 


while the modified Ampere’s Law gives 

dB2z 9^1 V 


OD^- ua->Y I / \T 

(TE) jc: 


+£o £ 


4(y,CDo) + cooel{(y,coo)] 


dB'^x 3^1 V 3^1 - I r s-r 

(TE) y: = -/eo(0oe^(>’,o)o)>l2y 


+eo Ej^ 


dBjy dB'>x I . x~A 

(TM) f: -^--^ = -/£oa)()E;^(y.a)o)A2, 


I inalK. ihc divergence laws give 


ae!^(y,o)o) , 


(y,coo) + woe^(v,®o) ^ 


+eo E!^(y,a}o) + (Ooe^(y,o)o) 


3^1 V 3/\2v ^Azz 3/4 1 ; 


— I 

/i:.+£«(v.«.) •5F+-9r+'3r+ 3z; 




9e!;(y.too)34u 



dy 

dT 

3fii ^ 

35'' 

V SBw 

aSi- 


‘^“3^ 

"^“aT 

■^az; 


■ I', ^ 3 5Aiv , 3Ai; 

+ 4(y,o^,)j^ -^ + -jr =0 


(C.35) 

(C.36) 

(C.37) 

(C.38) 

(C.39) 

(C.40) 


(C.41) 


(C.42) 


ri: Mode 

Now the cn\ elope equation for the TE mode is derived. Taking the :-derivative of equation C.36, and substituting equation C.38 
and the\ -derivalivc of equation C.37, results in ' 

3“/4''i 3"/42 i I -- 3“/4i; 3 A[y /p 

— 4 ---^EIp v.tiXi / 4 v = (C. 43 ) 


a-A^. 

1 . 


a-Ai, 

a-A,, 

+ - 5 ^ + 
dy- 


. 0 ) 1 ,) A 

“ a^az ^ 

a.vax 

•oTE^ 

'P' az, 

f OX) 

- - 

C" 

z\^{y. 

C)o) + O)oe^(. 

.v,o)o) 

a=Bi, 


asiv 

a^B.v 


■^■^“avaT- 


az, 

dzdT ■ 



After subsiiiuiing from order £, the TE mode equation reduces to 

d'Azx d-Azx W(1 1 , 

3 -^ + -4!-z\j.(y,iOQ)Azx ^ 
az' 3v“ r“ ^ 


“"'P aZ| 37 J dy dX ' 


(C.44) 
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Using C.34, equation C.44 can be written 


0)q I —7 

-^+ -^ +(V. 


(C.45) 


- i [{[ii’^]=-i-(v.c«o)}(/;'='(,,(o„) - ^= 


,[oTe3A.« . . 


-2/p^+K)’.(Oo)*'(y.«>o)-5^ 


dy dX ■ 


The left-hand side can be reduced by taking a frequency derivative of equation C.23, 


{[pTE]=-(■=(,, <00)} 1/5 (y ox,) - 


(C.46) 


A { [3TE] 2 _ I ^TE(, 


which, upon substitution, results in 




{CAD 


.je^Au , ...glneL (y.oo) 8^,,. 


-2/ pT‘^^ + A(y.o),,)A'(y,a)o)^ 


dv dX 


_ 

O■ RTF, \dAu OTK'^1 _ ^ 

— --./p "tP :a 7- ;iy ' 


cir J a.v ax • 

Equatii>n C.47 is in the form Lit = / where L is an operator and / is the forcing function. By the'Fredholm alternative 
theorem 1194). this equation has a solution if and only if (nj,./) = 0. where //;, is the homogeneous solution, i.e. Liih = 0. The 
homogeneous solution ft>r4o, is just the TE mode so that oc In order to guarantee a'unique solution then. 


[(/ 7 /;|y<o.))’</.+P"r-^ / [t' 5 (y»»i))’<'y 


(C.48) 


which can be written in the simpler form 

_ _ oTh' 

dZ, ~ ^ dT 


(C.49) 


.35™ ,[rtni_ nn.|■ .1 ^ti)()) Uj^jy. cou)[aIn e!,^ (y. cop)/a.vKv 

ax 26’^^/[t/7‘"'(.'’.Wo)]'ay 


Note that the TE-mode group-delay is defined as 

= (C.50) 

dojo 

which can be evaluated from the eigenvalue equation C.23 or the TE dispersion relation C.25. The TE-mode group velocity is 
defined as vj*"' = 1/P^*^ • 

The extra term in equation C.49 indicates linear coupling between the TM and TE modes due to the variation of the TM 
mode along the .v direction, thus serving as a source term for the TE mode. This effect arises because of the differing rates of 
diffraction on either side of a dielectric interface and does not occur in homogeneous regions; in the (3+1 )-D case, the dielectric 
“interface" is due to nonlinearity, which occurs at order e\ and this term appears with the vectorial nonlinear couplings at order 
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Since the TE mode travels with the natural phase delay and the extra component generated by the TM mode is driven 
with the phase delay P™, there is a coherence length. Lcoh = [P^"^ - P'^^]"', over which the effects of this term are important. 
It is assumed that the propagation lengths of ultimate interest (~ Z:) are much longer than this coherence length (~ Zi) so that 
the generation of a new TE component averages to zero, and the term can be neglected. 


TM Mode 

Performing the same steps with the TM mode results in 


d-Bn, d-Bz, ^^k(y-^^dBzr , ,2/ „ ,-5 

r , dSul 3lnek(y,coo) 3Siv 

_2,- 

a raine^jy^l d-Bu 

d(Oo dy dydT 


(C.51) 


Taking a frequency derivative of equation C.28 allows C.51 to be written 

d~B', a-sV dB\ ,-t 


3 rr„TM,: . : 8 a-g|., 




3(0o 9^ dydT 


^ ,U. ... . 


2/[p™:^ + il-(y.to„)l'(y.co«):^ 
a [3lnEij^(y.(i)o)] d-Bu 


dv dX 


(C.52) 


,TM r , rtm' dBu] d^nz.j.{y.(no) 3 fl,, 


^ i .r o 1 M ’ M 

az, ^ dT 


dv dX ■ 


.Again using the I-rcdholm alternative theorem, equation C.52 is written in group-delay form 


_ _ pTM' 

az, ~ dT 


3/1 -3™]- * (.''• t Uu) ^^' ^0) /]^y 

2pTN'i/[Kt™(.v,a)o)]"<v ■ 


(C.53) 


The cfleetive T.M-mi>de group-delay coefficient is 


_ ap^'' 


(C.54) 


where P^'' is obtained from equation C.28 or the TM dispersion relation C.32. 


C.3 Order E^ - Coupled Nonlinear Schrodinger Equations 

The nonlinear Schrodinger equation appears at this order for both the TE and TM modes. The difference between the re¬ 
sults obtained here and those obtained in the (3-t-l)-D derivation is the appearance of effective material constants representing 
diffraction and nonlinearity. Because of the linear coupling between the TE and TM modes due to transverse index variation, 
diffraction in the unguided transverse direction is modified slightly from the linear homogeneous case. Note that this effect 
does not show up in purely scalar analysis [ 194). but, except in the strongly-guided case, can be neglected nonetheless. The 
effective nonlinearity is given by modal overlap integrals of the material nonlinear coefficient with the appropriate waveguide 
profiles for .self- and cro.ss-phase modulation. 
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At this order, the second derivative with frequency of the mode profiles associated with A i and Si must be included, along 
with the first derivative with frequency of the mode profiles associated with At and St: 




Again, A^^ is the unknown component of the electric field. 
Now, Maxwell’s curl equations result in 


(TE) y : 


3/43; 

3/43v 

3/4 2v 

8 a ly 

_ 

8 Bt, 

dy 

8 c 

“ 8Z, 

“ 8Z3 ■ 

= /jUoOJoPS-t 

-Po 


3A2v 

8 Aiv 

8 A 2 ; 


8 Btv 

dz 

8^ 

-1 - - 

8Z, 


= />o“oP3y 

^87 

3/4 2v 

8A3t 



8 B 2 , 


8A- 

““87 

= />0W0O3; - fJQ 

'W' 



(C.55) 


(C.56) 

(C.57) 

(C.58) 


8y 8; 8Z, 82, -V u u; 

+Eo (,v. (Do) + (>'. “o) ] - 2*cooP|f 

r I', \ . “o_i"/ .. ^'^ix 


(TE) i 


+/Eo[£^(y.coo) + fE^^'(y,coo)J^ 

dBxr 8Stj 86|, 86t. l 

[ 1 I ' \ 1 ^ (3) 

-hcoo£j^(v,coo) J - 2 /a)oPj;. 

r I' / \ ^0 1 " / xi iv 

-f/Eo + (y-^o)J ^j2 

dB-iy 8Bi, I , 

(TM) f: = -/£oto,)e^(y.u)o)A.-,; 

+eo eij^(y.(Oo) + (Ooe!^( y. coo) ] ^ - 2 /a)oP[f 
'll , (i)() I // 1 d~A I- 

where ihe third-order polarizations are written, for example, 

= eo« 0 'i:( v) {[ |AI-+2A |A „ 1-+2A |A „|-] A ,.,+7 [aI.+A^ A^} 


I 3-^1: 

I 87- ' 


The Maxwell divergence equations give 


dA-^y 3/4 -»r 3^3- 3/4-)^ 3/4 1- 

3v 3X 3c 3Z, 3Z. 


36!n(v, coq)— , 3/43/4 3 A 3 - 3/42- 

8 Ek’(v,a)o) 8 A-.^. 8 8 A->v 3Ait 8 A 2 ; 

[■ar'"l7 + -ar- 

1 8Ey'(y,tOo) 8 -Aiv 1 L", \ [3^1v , 3^1: 

-2-"V-a!=r-54(.«.‘»»)§f: -ar+-ar = 


+'^^-ar+'=*'''-"“’aT IT'" «r + -ar+aJr 


(C.59) 


(C.60) 


(C.61) 


(C.62) 


{C.63) 
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Eo dy Eo 3; 


, SBjy 903; 3fi2; 


(C.64) 


TE Mode 


As before, taking the -derivative of equation C.57, and substituting equation C.59 and the y-derivative of equation C.58, results 




d^^2z 


9“^2v ^ 9^1v ^ 2 d(V 

-1^ e^(y.tOo)+woE^(>',a)o) "9^ ■ 

Using the appropriate substitutions from order e- and order e, this equation can be written 

9"A3v d~Ax\ ,-T 

ar ^8zaz2 dx^- 


^ + {[A-'{V, coo)]' + Hy. Oio)k"(y, t«o)} -^ - 2mioP[ 

3 [31nE!j^(y.coo)] 3-^1, ainEii^(>',coo) 3^2^ 

3(^ dy 3X37 3v W' 


With the substitution of C.55, the terms on the left-hand side of equation C.68 containing A 3 v become 

3"Ai, 3"A3v , *>, ,-r 

^ • -; ~H—5"^;—h — 

dz- 3 v- 


d~A‘ir 3"A3^ -» ,--r 


+ 2/ - A-(y,a)o)A'(y.coo)] ^ 

- I [A'(y.a)a)]--A-(y,coo)r^ ^ 

- 2 [pTF-pTP-' -tL(y,( 0 „)A'(y,( 0 u)j f;7^'(y,coo)^^. 


Now. equation C',66 reduces to 


3*^3, d-A\. ,1, ,-t 

■yr + y' ^ 

rTE 5'^lv , oTE' a'^l< o pTE ^'^l t _ a~Al.i _ d-A I t 

32,^*^ 37 322 3X2 322 


,.pTFBTE',,TE'. ^ 

-h_p p U, (\,(Oo) 


(C.65) 


(C.66) 


(C.67) 


(C.68) 
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3lne^(>’,coo) 91ne^(}',(0o) 

^ dX^~ dy dX 

-2co5po^[f, 

which has homogeneous solution Applying the Fredholm alternative theorem to the right-hand side of 

equation C.68 gives the following condition 


iTE I oTE' ^^P . PlP 

[ azj ^ dr azj j 


■2|3™ + I ur’{y.n,]uPty.m)<ly 


[uJ%’.(i)o)]-dy 

a-Ap a-A|E] /-r.-TE/ M-j 

"arr + ir i 


ar-jy 

-h 2nokj |Ap|'A|^y^«;(y) [t/p(y,coo)]‘*rfy 

-E 2nok} {2A |5™|-Ap + Y -aPV-'IP™-^^!^} x 

j nzLv) {[v;™(y.a)o)t/,p(y,coo)]> [v;.™(y,(Oo)f/7^(y,a)o)]'}<v 

f 3 v 7 “(,.,<o„)y™(y,<o„)rfy 

dXdT J dox) [ dy 

+ /■!!7!|!^l-,TM(y,C„0)y7E(y,0,0)<V = 0. 

ax J dy 


Using equation C.49. the Ap terms with Z; derivatives are written 


.orTE £ji!_ 
dZ,dT 


I Ur'uy-dy = ^ I Uj^'uj^dy 


■ 3-BP’ ,[p™-pTC), / Uj^'uy-dy J V^Uridlmlj^ (y, mu)/ay]rfy 
'dXdT^ 


v7«yJ'=|Slne!^(y.«l.)/8yKv 

' I rfy I [ 5^%^] rf, 

a“/4) I _J_ L _ 

aX= 4PT1-PTMJ[VTM]P/V 

Finally, using these expressions and defining the effective diffraction and nonlinear coefficients, the evolution equation becomes 


(C.69) 


(C.70) 


(C.71) 


aAl^ ^^Pl TE^"'^ 


dr dz. 


Ll+oTE^^-pTEpTE"^ 


TE-spni TE-xpm 

+2A.= 7l^lApfAP + 4A7^^ 


dr- 

a|bP1“aP = o, 


(C.72) 


where the phase-dependent terms arising from the .v projections of the TM mode have been dropped. This is Justified because 
the terms that show up at this order are significant over the distance scale Zs (and farther), and the phase dependent terms 
are assumed to have coherence length Z^coh ~ • ^nd therefore average away for NLS propagation. For the same reasons, the 
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phase-dependent terms in the nonlinear polarization are also dropped, because they have a phase factor associated 

with them, and coherence length Lcoh/2- 
The diffraction coefficient is defined as 


dyJV™Uj^ dy 

4pTEpTM J [yTE]2^^ J [Vjufdy 


(C.73) 


which has primary contribution from linear homogeneous diffraction and an additional contribution arising from the linear 
coupling between the two waveguide modes. This latter contribution is assumed small and a taken as unity. In the case of a 
step-index waveguide, this assumption is well justified by noting that the derivative of the dielectric function results in a delta 
function, which, in the integrals in the numerator, serves to sample the value of the product of the modal envelopes at each 
interface. With a finite number of interfaces, these contributions will be much smaller than the continuous summation given by 
the integrals in the denominator. 

The effective self- and cross-phase modulation coefficients are defined 


TE-spm 


TE—xpm 

tu 


_ //>:(>•) [l/7^(y.o)o)]^^y 

/[f/TEJ-rfv 

_ / ni (y) [Uj^jy, (Op)(y, cop)]' dy 
J[Uj^]-dy 


(C.74) 


(C.75) 


TM TM ^ 1 '^ 

Note ih.ii in writing the effective nonlinear coefficient for cross-phase modulation that it was assumed that V. <SC Vy , Ux , 
which IS \alid for weak guidance. Due to the overlap between the TE and TM modes, the effective ratio between^the self- 
ph.ise m>>diilation and cross-phase modulation coefficients may not equal the intrinsic value 2A, because, in general, ;i, 


Takinc the results from this and the previous order, the nonlinear wave equation for the TE mode in laboratory units is 


3; ^ dt 

TE-sprn 


RTERTE"fd!! 

^ dt- 


laThi: .TE^ i,:": 


TE-xpm 

-Ale™|-ATE: 


(C.76) 


TM Mode 


IVrlorming the same steps w ith the .v-component of the magnetic field results in 
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(C.77) 
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which has homogeneous solution ^ 3 ^ «= Applying the Fredholm alternative theorem to the right-hand side of 

equation C.77 gives the following condition 


, dBj^ 

9Z, 07 dZn 


+ [^-{[p™']%P™P' 


- l[V™{y,a^)]-dy 


(C.78) 


iTM 


; 5 :dTM ;) 2 dTMi ^ 

^ + P™'^1 / V:,™'(,,coo)V,™(,.cooKv 
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+ ^{2AP™ |aP|'b™ + Y [2P^^- P™] [A™]'B™.,2,[pTE_p™],| ^ 

/« 2 (v) V.(Oo)] ' V™(y,Oio)V™{y, (Oo)dy 




Ti™3Ap ,[bte_btm]. /•dlnE!^(v.tOo)^^^g 

dX ^ J dy 


i/v (Y.wo)V,™(y.a)o)<v = 0, 


where again it was assumed that V™ Uj^ such that the c component of the nonlinear polarization P]. can be neglected. 

As before, using equation C.53, the 5™ terms with Z\ are rewritten, allowing the solvability condition to be reduced to 


o gTM r_J_ rTM' j 

^ ^ [ 0 Zj 07 ^ 0 Z, J ■ ^ 0 X- ^ 07- 


(C.79) 


TM-.spin 




TM-xpm 


where the phase-dependent terms arising from the.r projections of the TE mode have been dropped. The phase-dependent terms 
in the nonlinear polarization are also dropped. The diffraction coefficient is defined as 


'' A ii\ 


dyjUj^Vj^' 


(y.ojo) 

d\ 


/ ./y / f/JEv/TM 

= 1- ^ - i -,-L-J-, (C 

which is again taken as unity, and the effective .self- and cross-phase modulation coefficients are defined 

..TM-spm 

^ - (C 

/[l-TM]-rfv 

..TM-xpm _ /":(>•) [^-^■'"(v.a),,)]'v;™(v,ioo)K,™(>'.a)o)<v 

/In - =-. ((_ 

/[V7M]-Jv 

Taking the results from this and the previous order, the nonlinear wave equation for the TM mode in laboratory units is 

2'P™ f ^ + P""’' ^1 + ^ - P™P™" ^ (C 


(C.80) 
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Final Report AFOSR F49620-95-1-0431, Spatial Soliton Interactions, S. Blair & K. Wagner, U. Colorado, Boulder 164 

As mentioned previously, these results are obtained strictly for the purposes of illustration of the effects of tfansverse 
guidance, which, in the weak guidance approximation, results is effective linear and nonlinear coefficients. The modified NLS 
equations are not used for the simulations directly, but the effective linear coefficients are evaluated in Appendix D for the 
purposes of obtaining the birefringence beat length and the group-velocity walkoff length. 




Appendix D 

Optical Properties of Fused Silica 


The numerical simulations presented in this thesis make use of the material parameters of fused silica. The reasons tor this 
choice are numerous: fused silica is highly transparent in the visible and infrared with small linear and two-photon absorption 
and ultrafast non-resonant nonlinearity, has anomalous group-delay dispersion at the telecommunications wavelengths around 
A/ = 1.55 /rm, has high damage threshold, is well characterized, and has well-developed technology for fabrication into slab 
and cylindrical waveguide devices. The main downside, though, is the relatively small nonlinear coefficient. Because of this 
fact, other nonresonant nonlinear materials, such as AlGaAs and PTS, will ultimately need to be studied, but fused silica serves 
as a useful material on which to base initial investigations and the results obtained should be generalizable to other materials. 

The first section discusses the optical properties of fused silica in bulk, while the second section looks at glass slab waveg¬ 
uides, in which the linear properties are modified due to transverse confinement. 


D.l Properties of Bulk Fused Silica 

For comparison, linear and nonlinear properties of fused silica and other candidate materials are shown in Table D. 1. The other 
materials are presented here because optical properties are used in Chapter 5 to derive figures of merit for use in spatial soliton 
switching. 

For the spatio-temporal simulations, the linear index dispersion and nonlinear Raman dispersion need to be discussed. 
Appendix B covers the Raman response function and Raman susceptibility spectrum for fused silica. The linear index dispersion 
is typically given by a Sellmeier-type relation of the form 


■> , 4- Bik- 

n-(k) - 1 + X rTI3' 

j=\^' 


(D.l) 


where kj are the resonance wavelengths and Bj are the associated oscillator strengths. Table D.2 lists the coefficients for fukd 
silica |30]. 


Malcnal 

fused silica 

39Ph silicate 

PTS 

A1 igGag2As 

reference 

[30] 

(1561 

[125] 

[127] 

M) {pm) 

1.55 

1.064 

1,6 

1.55 


1.444 

1.774 

1.8 

3.4 

*0' (ps-/pm) 

-2.79 X 10“** 

1.17 X lO-’ 

> 0? 

1 X 10-^ [157] 

a (cm"‘) 

0.1 

0.1 ! 

O.S*’ 

0.15 

n{ (cm"AV) 

3.3x lO-'o 

2.2 X 10-'^ 

2.2 X 10-'- 

1.2 X 10-'^ 

P'i (cmAV) 

5.5x10"'^ 

7.2 X 10-'-* 

<5.0x 10-“^ 

siO^ 

ni'(cm'*W-) 

7 

•) 

-8 X 10-“ 

-5x 10--^ [143] 


“ absorption at Ay = 1.3^im used [158] 

* three-photon absorption is estimated at P 4 = 0.055 x 10~'* cm^/W [127] 

Table D.l: Linear and nonlinear properties of promising materials for nonlinear optical switching. The values for the linear 
absorption coefficient are valid for slab w'aveguide geometries and are dominated by waveguide scattering. 
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J 

resonance wavelength Xj (pm) 

oscillator strength Bj 

1 

0.0684043 

0.6961663 

2 

0.1162414 

0.4079426 

3 

9.896161 

0.8974794 


Table D.2: Sellmeier coefficients for fused silica. 


A plot of index dispersion over visible to near IR is shown in Figure D.I.' The refractive index n determines the phase 



0.6 0.8 1.0 1.2 1.4 1.6 1.8 2.0 

wavelength A (in jum) 


Fiaure D. 1: Linear refractive index « and group index for fused silica. The zero dispersion wavelength = 1.27 pm divides 
tile boundar\ between the normal dispersion regime (X < X^j) and the anomalous dispersion regime (X > X^j). The curves are 
marked fur their values at 1.55 pm. 

velocity v,, = c'//i. where r is the speed of light in free-space. The material propagation constant X = m/c is also referred to 
here a.s the pha.se delay coefficient, since it is related to the inverse phase velocity. Of more relevance to this thesis, though, 
are the group velocity and the group delay coefficient. The group velocity is the velocity at which a quasi-monochromatic 
wavepacket travels, which is in general different than the phase velocity, and is most easily calculated from the group delay 
coefficient. The group delay coefficient is the frequency derivative of the phase delay; 


where n^. is the group index. Now. the group velocity is 


dm ~ c 


on 

n+O)^ 

dm 



(D.3) 


The group index is also plotted in Figure D. 1. showing that, over the spectral range plotted, the group velocity is less than the 
phase velocity, which is generally true away from resonance. 
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Another useful relation is the calculation of group delay based on the change of index with wavelength. Noting that 

iTtc d X- d 


(0 = 


3(0 27tc dX ’ 


the group delay can be written 


c 


X— 

^dX 


(D.4) 


(D.5) 


Group-delay dispersion describes the variation in group delay across a wave packet spectrum and is responsible for pulse 
broadening in linear propagation. The group-delay dispersion coefficient is calculated 

d-k 1 


k" = 


3(0- 


3/1 3-/11 

3(0 3(0- 


X^ 3-/I 


(D.6) 


~2nc^-dX^' 

Figure D.2 plots group-delay dispersion over the wavelength range 0.5 - 2.0 ^m. The zero-dispersion wavelength is also shown 



0.6 0.8 1.0 1.2 1.4 1.6 1.8 2.0 

wavelength X (in /um) 


Figure D.2: Group-delay dispersion k” for fused silica. The zero dispersion wavelength = 1-27 divides the boundary 
between the normal dispersion regime (X < X^d) anomalous dispersion regime (X > X^d)- The curve is marked for its 

value at 1.55//m. 


on this plot to emphasize that, since group-delay dispersion is zero at this location, a wavepacket should propagate without 
change. Of course, this is not strictly true, because any wavepacket has nonzero bandwidth and will broaden because the slope 
(called third-order dispersion) and curvature (fourth-order dispersion) of group-delay dispersion are nonzero at this wavelength. 

Higher-order dispersion becomes important for broad temporal bandwidths. The simulations of spatio-temporal solitons in 
this thesis use the higher-order dispersion coefficients because such broad bandwidths are present. Third-order dispersion is 
calculated from 


^ = 1 
00 )-^ c 



+ 0 ) 


3(0^ 


(D.7) 


X^ 

471 “ 


dX- 




3Xil 

3A.3 
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Fourth-order dispersion is calculated from 


k"" = 


3 ( 0 '’ 


3 -^« d‘*n 
*^3(0^ '^*^3co'‘_ 
3“/! o, 


(D.8) 


8ji^c^ 


Plots of third and fourth-order dispersion for fused silica are shown in Figure D.3. 




I 'lyurc 1)3 1 !ti:hcr-iirder dispersion for fused silica. Third- (solid curve) and fourth-order (dashed curve) dispersions are plotted 
versus \va\eicn^(h. The curves are marked for their values at 1.55 pm. 


As a summary of this section, the appropriate dispersion coefficients for bulk fused silica at the communications wavelength 
A = 1 55 pm arc listed m Table D.3. 


coefficient 

bulk SiO: 

units 

fh) 

1.444 



5.854 

l/pm 


4.875 

fs/pm 


-0.02791 

fs“/pm 

N) 

0.1511 

fs'VT^rn 

i.iiii 

^0_ 

-0.4931 

fs'^/pm 


Table D.3: Calculated dispersion coefficients for bulk fused silica at?i= 1.55 pm. 


D.2 Properties of Glass Waveguides 

The waveguide geometry of interest is shown Figure C.l. This sections derives the TE and TM eigenvalues and modal profiles 
in this three-layer slab case, and evaluates the modal phase delays and P™ and the effective linear coefficients derived in 
Appendix C. 






Final Report AFOSR F49620-95-1-0431, Spatial Soliton Interactions, S. Blair & K. Wagner, U. Colorado, Boulder 169 

A cross-sectional view is shown in Figure D.4. Since the structure is piece-wise separable, the following definition is used 
which represents phase delay in each layer 

k] = tyrpQtj = 

where = tj. Note that it is assumed that the magnetic permeability is the same in all regions. 



Figure D.4: Cross-sectional view of three-layer slab waveguide geometry. In general, the refractive indices of the two cladding 

layers are different, leading to an asymmetric structure. 

D.2.I Derivation of TE Mode 

The solution for the TE mode must satisfy equation C.23. repeated here for convenience 

= {[P'^‘-']'-it-(y.a))}l/™{y.a)). (D.IO) 

along with the boundary conditionsC.24. It is clear that, depending on the size of relative to A-(y.(o), equation D.IO has 

two types of solutions: 

=aie“''when [P^*^]'> l;"(,v.a)) (D.ll) 

4-hze'""' when [p^^]‘< A-*(y,a)). 

The transverse wave numbers are defined 

ar=[pT‘-f-A-(y.( 0 ) (D.12) 

It is clear that in order to satisfy the boundary condition of finite one-dimensional power, the modal profiles must decay in 
the cladding regions, implying that the effective longitudinal wave number P’’'^ is greater than the bulk propagation constant of 
the cladding. This also implies that, in order to satisfy the continuity boundary conditions, the field in the core region cannot 
decay, such that the effective longitudinal wave number is less than the bulk propagation constant of the core. 

With that in mind, the following piecewise solution is posed 

aifi-“i>.'' y>d/2 

£»cos(K 2 v.v-l-<t)) —d/2<y<d/2 

flic”’’' y <—d/2 



Uj^(y.tn) = 


(D.13) 
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where the solution in the core region is written in the form of a cos() function without loss of generality. Note that the non¬ 
decaying solutions in the cladding regions have been eliminated. Imposing the boundary conditions at y = d/l yields the 
following constraints 

= ^cos(K2y^//2-|-(t)) (E)-14) 

=-K 2 v^ sin (k2v^//2+ ([)), (DAS) 


which can be divided to obtain 


Similarly for the fields at y = -d/2, 


—^ = tan (K 2 yd^/ 2 + ([)). 
K 2 v 


aze = bcos {-K 2 yd /2 -f (J)) 

= -K;vl»sin(-K2j.d/2-f <t)) 

=> — = tan(K 2 vd/ 2 -(|)). 

K 2 v 


(D.16) 


(D.17) 


Now there are two equations and one unknown - <>. Forcing both equations to give the same value for 0 results in the 
condition 

IaTE , IaTE 

— 4- nTT m.is^ 


2^1 + ::^03 ^ ^2yd ipTT, 

where p is a nonzero integer and the phases are defined 

(t)p = 2tan’'‘ (aiy/K2y) = K2 v^/+ 2(1) ±/in: 
(t)p = 2tan” ‘ (a 3 v/K 2 y) = y^iyd - 2(1) ± mn. 

Defining the effective index 

K:v = 

the equation for the modal eigenvalue can be written 

/II? — »i-_ — — d)T^ = 2nJi. 


2kod^n^ - n-ff - 0f - 0^ = 2pn, 

which can be solved numerically via the bisection method. The phases are rewritten as 


0 !^ = 2 tan”' 


0T = 2 tan" 


f~^ ^ 

- n-a 


(D.18) 

(D.19) 

(D.20) 

(P.21) 

(D.22) 

(D.23) 

(D.24) 


With ;icii. and hence known. K 2 v. ttiv, a.v. and 0 can be calculated. From this information, the overall profile can be 
calculated 

{ cos(K 2 v<// 2 -l- 0 )e y>d/2 

Uj^(y, (u) = 0 < cos(K 2 v.v 4-0) —d/2 <y <d/2 (D.25) 

I, cos (K 2 vd /2 - 0)e“3>l'’+‘^/-l y < -d/2 

where b is an arbitrary (real) constant. In Appendix C. this constant is taken as unity. 


D.2.2 Derivation of TM Mode 

The solution for the TM mode must satisfy equation C.28, repeated here for convenience 

a-K7'^(y.tOo) 8lne^(y,coo)3v7M(y,ci)o) _ 
dv- dy dy 

{[P™]'-i 2 (y,coo)}F,™(y,(Oo), 


(D.26) 
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along with the boundary conditions C.29. When taking the boundary conditions into account at each interface, the eigenvalue 
equation to be solved in each homogeneous region is [159] 


d^-V™iy,(Oo) 

dy^ 




which has the same form as for the TE mode. 

Following the same steps used in the previous section yields the equation for the TM effective index 

I j aTM aTM _ t 


IkodJnl - nl^ - - <t)I^ = 2pn. 


(D.27) 


(D.28) 


The phases are defined 


j)™=2tan-' 


(ttj= 2 tan' 




n\\hr^-K 


: K 2 v^ + ± nn 


= K 2 v^ ” 2(t)' ± nin, 


and the overall profile can be written 


V,™(y,0)) 


cos (K 2 v ^//2 + (!)>-“»y > dl2 
cos(K2v.v + 0*) —dl2<y<dl2 

cos (K:v<f/2 - o')e“3vt'+‘//-l y < -r//2 


D.2.3 Linear Waveguide Dispersion Properties 


(D.29) 


(D.30) 


(D.31) 


This section calculates the dispersion coefficients for the TE and TM modes, taking into account both the material and waveg¬ 
uide dispersions. The waveguide structure is assumed to be three-layer, as illustrated in Figure D.4. 

The Sellmeier coefficients for three glass compositions [ 160] are given in Table D.4. These compositions are representative 


coefficient I S 1 O 2 Ge 0 ;:Si 02 (0.135:0.865) | B 20 i:Si 02 (0.133:0.867) 



0.069066 


0.115662 


9.900559 


0.696750 

0.408218 

0.890815 


0.064270 


0.129408 


9.425478 


0.711040 

0.451885 

0.704048 


0.061900 


0.123662 


9.098960 


0.690618 

0.401996 

0.898817 


Table D.4: Sellmeier coefficients for representative glass compositions suitable for waveguide fabrication. 

of glasses which are common for fiber and integrated waveguide devices. Because of the variability of glass properties between 
different melts and processing conditions, the coefficients for fused silica (SiOi) in this table are slightly different than those 
listed in Table D.2. 

The three most important effective waveguide quantities to this thesis are the difference in the phase delays between the 
TE and TM modes, the difference in the group delays, and the effective group-delay dispersion coefficients. The first quantity 
leads to a beat length between the modes defined by 

u , -__=_ h. -- (D.32) 

'‘---IpTE.pxM, 

When the propagation distance is much greater than the beat length, then phase-dependent terms in the nonlinear polariza¬ 
tion can be neglected as done for the simulations in Chapters 5 and 6, and the phase-dependent linear couplings derived in 
Appendix C for the slab waveguide geometry can also be neglected. 







Final Report AFOSR F49620-95-1-0431, Spatial Soliton Interactions, S. Blair & K. Wagner, 17. Colorado, Boulder 


'172 


The second quantity leads to temporal walkoff, defined by 

Twaikoff = AP'z, (D.33) 

where AP' = - P^"^' is the differential group delay coefficient, z is the propagation distance, and the effective TE and TM 

group delay coefficients are defined by equations C.50 and C.54. When the walkoff time is greater than the temporal duration 
of interacting pulses, then the interaction distance is limited to the walkoff distance, as defined by equation 1.33. As shown 
in Chapter 6 though, this birefringent walkoff can be beneficial in order to counteract the temporal walkoff due to the strong 
Raman down-shift of the pump solitary wave. 

The final quantity, group-delay dispersion, is only important for the spatio-temporal studies in Chapter 6. In this case, a 
negative group-delay dispersion (AGDD) coefficient allows for the formation of a bright solitary wave. The magnitude of this 
coefficient determines the ratio between transverse width and duration (and spatial width), with small magnitude leading to 
short duration and large magnitude leading to long duration. 

These three quantities for X = 1.55 pm are plotted as a function of core thickness in Figures D.5 and D.7 for symmetric 
waveguides, and in Figures D.6 and D.8 for asymmetric waveguides, with air serving as the upper cladding layer. The waveguide 
compositions shown consist of Si02 core with B 203 ;Si 0 ; cladding, and Ge02:Si02 core with Si02 cladding. These figures 
indicate that the use of asymmetric guides leads to shorter beat lengths and greater differential group delays. 

For the spatial soliton logic gates studied in section 5.4, the gate lengths were 2.63 cm, indicating that /i^at ^ 2.63 cm. 
Numerical simulation has shown that the phase-dependent effect of the vectorial four-wave mixing terms is nearly eliminated 
when the device length is greater than about 5 /teat- Therefore, for the spatial dragging gates, /beat < 5 mm. This criterion is met 
bv the asvmmetric waveguide of Figure D.8, for core thicknesses of about 4.5 ^m or less. 

The spatio-temporal logic gates of section 6.3 have gate lengths of 8 Zb = 13.2 cm, so that /beat < 25 mm. This constraint can 
be met b\ all waveguide structures except for the first symmetric waveguide with SiOi core. There is an additional constraint in 
(he spatio-temporal case, however. As discussed in section 6.3, due to the Raman down-shift, in cascaded operation, group delay 
birefringence is necessary to equalize the group velocities of the downshifted signal and clean pump in all cascaded stages. This 
differenlial group delay is about 2.3 to 2.8 fs/^m, which can only be obtained by the second asymmetric guide of Figure D.8 
with core thickness about 2 pm. Note, though, that the first asymmetric guide nearly fulfills this additional requirement as well. 



Final Report AFOSR F49620-95-1-0431, Spatial Soliton Interactions, S. Blair & K. Wagner, U. Colorado, Boulder 



waveguide thickness (in 



waveguide thickness (in /i-ni) 


Figure D.5; Linear properties of silica (SiO^) core, B 203 ;Si 02 symmetric clad slab waveguide versus core thickness. All 
quantities are evaluated at k/ = 1.55 pm. The birefringent beat length /beat is defined by equation D.32, and the differential 
group delay Apo leads to temporal walkoff defined by equation D.33. For comparison, the GDD coefficient for the core 
material is -0.02745 fs’/pm. and -0.03865 fs-/pm for the cladding. 






Final Report AFOSR F49620-95-1-0431, Spatial Soliton Interactions, S. Blair & K. Wagner, U. Colorado, Boulder 


174 



3 4 5 6 7 8 

waveguide thickness (in jum) 



61 


-0.028 

-0.030 

-0.032 

-0.034 

-0.036 

-0.038 

-0.040 



3 4 5 6 7 8 

waveguide thickness (in /xm) 


Figure D.6: Linear properties of silica (SiO;) core. B;0i:Si02 lower clad and air upper clad slab waveguide versus core 
thickness. The dispersive properties of air are neglected for simplicity. All quantities are evaluated at Xf = 1.55 pm. The 
birefringent beat length /t«ai's defined by equation D.32. and the differential group delay Apjp leads to temporal walkoff defined 
by equation D.33. For comparison, the GDD coefficient for the core material is -0.02745 fs‘/pm, and -0.03865 Is^/pm for the 
lower cladding. 
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waveguide' thickness (in ^m) 
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^ - 0.022 

-0.024 

£ -0.026 
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2 3 15 6 7 8 

waveguide thickness (in /urn) 

Fijiurc D.7; Lincur properties ot Ge02.Si02 ci^re. (SiO^) symmetric clud slub waveguide versus core thickness. All 
quantities arc evaluated at /./ = 1.55 pm. The birclringcnt beat length /^cat is defined by equation D.32. and the differential 
group delay AP,', leads to temporal walkoff defined by equation D.33. For comparison, the GDD coefficient for the core 
material is -0.01733 and -0.02745 ts~ //jm lor the cladding. 
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Figure D.S: Linear properties of GeO;:SiO; core, silica (SiO;) lower clad and air upper clad slab waveguide versus core 
thickness. The dispersive properties of air are neglected tor simplicity. All quantities are evaluated at Xf = 1.55 pm. The 
birefringent beat length is defined by equation 0.32, and the differential group delay AP^ leads to temporal wal^koff defined 
by equation D.33. For comparison, the GOD coefficient for the core material is -0.01733 fs'/pm, and -0.02745 fs-/pm for the 
lower cladding. 





